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The  host  of  the  Eleventh  Conference  on  Applied  Nathenatics  and 
Computing  was  the  Center  for  Nonlinear  Analysis  at  the  Carnegie- 
Mellon  University,  Pittsburgh,  P.A.  It  was  held  on  Jtine  8-10, 
1993.  Professor  Morton  E.  Gurtin,  the  Principal  Investigator  of 
this  Center,  served  as  Chairperson  on  local  arrangements.  He  was 
assisted  in  this  task  by  Ms.  Francine  Johnson.  We  would  like  to 
take  this  opportunity  to  thank  these  individuals  for  all  the  time 
and  work  preparing  for  and  conducting  this  extremely  well  managed 
scientific  meeting. 

The  1993  conference  was  attended  by  more  than  70  scientists  and 
engineers  representing  various  Army  agencies  and  academe.  The 
meeting  featured  five  plenary  talks  and  four  special  sessions  on 
topics  of  current  interest,  such  as  Mathematics  of  Materials, 
topics  in  stochastic  Analysis,  Computational  issues  in 
Geosciences,  and  Virtual  Factory.  In  addition  there  were  37 
contributed  papers  presented.  The  names  of  the  Invited  speakers 
and  the  titles  of  their  addresses  are  listed  below. 
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Technology 
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Controls 

The  Role  of  Hard  Thermal 
Convection  in  Geosciences 

Stocastic  Spatial  Models  of 
Epidemics  and  Excited>le  Media 


Many  of  the  papers  given  at  this  conference  provided  the 
attendees  a  chance  to  see  scientific  techniques  developments 
taking  place  in  the  Army  laboratories.  Through  these  meetings 
techniques  developed  at  one  Installation  are  brought  to  the 
attention  of  scientists  at  other  places,  thereby  reducing 
duplication  of  effort.  Another  important  phase  of  these  meetings 
is  presenting  the  members  of  the  audience  an  opportunity  to  hear 
nationally  known  scientists  discuss  recent  developments  in  their 
own  field. 
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ABSTRACT.  We  introduce  some  of  the  concepts  of  computational  complexity  theory.  We 
then  survey  available  complexity-theoretic  results  for  some  of  the  miun  problems  of  control  theory. 

INTRODUCTION.  Our  subject  is  motivated  by  asking  what  it  means  “to  have  a  solution  to  a 
problem.”  The  answer  to  this  question  has  been  changing  with  time:  for  example,  ancient  Greeks 
required  a  constructive  procedure  using  ruler  and  compass,  classical  mathematicians  required  closed 
form  formulas  (maybe  involving  special  functions),  etc.  The  theory  of  computation,  as  developed 
during  the  last  fifty  years  considers  a  problem  to  be  solved  if  an  algorithm  is  provided  that  can 
compute  its  solution.  There  do  exist  problems  that  are  provably  unsolvable,  such  as  the  halting 
problem  or  Diophantine  equations.  Even  problems  that  are  solvable  in  principle,  as  most  problems 
in  control  theory  are,  can  be  of  varying  difficulty  if  one  considers  the  required  computational 
resources,  such  as  running  time.  Complexity  theory  is  the  branch  of  computer  science  that  deals 
with  the  classification  of  problems  in  terms  of  their  computational  difficulty.  In  this  paper,  we  define 
some  concepts  from  complexity  theory  and  provide  an  overview  of  existing  complexity  results  for 
several  problems  in  control  theory. 

COMPLEXITY  THEORY.  Mainstream  complexity  theory  deals  with  discrete  problems;  that 
is,  problems  whose  instances  can  be  encoded  in  a  finite  binary  string  and  are  therefore  suitable 
input  to  a  digital  computer.  Problems  (such  as  matrix  inversion)  have  instances  (any  particular 
square  matrix  defines  an  instance  of  matrix  inversion).  Different  instances  of  the  same  problem 
are,  in  general,  of  different  sizes,  where  “size”  means  the  number  of  bits  used  in  a  natural  encoding 
of  that  instance.  We  say  that  a  problem  is  polynomial  time  solvable,  or  for  short,  belongs  to  P, 
if  there  exists  an  algorithm  and  some  integer  k  such  that  the  time  it  takes  this  algorithm  to  solve 
any  instance  of  size  n  has  order  of  magnitude  0(n*).  Some  classical  problems  in  P  are  linear 
programming,  matrix  inversion,  the  shortest  path  problem  in  graphs,  etc.  It  is  often  considered 
that  problems  in  P  are  the  “well-solved”  ones. 

There  is  another  class  of  problems,  called  NP  (for  nondeterministic  polynomial  time),  that 
contains  all  problems  that  can  be  “transformed”  or  reformulated  as  integer  programming  problems. 
While  P  is  a  subset  of  NP,  there  is  no  known  polynomial  time  algorithm  for  integer  programming 
and  it  is  generally  conjectured  that  P  ^  NP.  If  this  conjecture  is  true,  then  integer  programming 
is  not  solvable  in  polynomial  time  and  the  same  is  true  for  those  problems  in  NP  which  are  the 
“hardest”;  such  problems  are  called  ATP-complete.  More  generally,  we  will  say  that  a  problem  is 
JVP-hard  if  it  is  at  least  as  hard  as  integer  programming. 

Proving  that  a  problem  is  iVP-hard  is  viewed  as  evidence  that  the  problem  is  difficult.  As¬ 
suming  the  vididity  of  the  conjecture  P  ^  NP,  ATP- hard  problems  do  not  have  polynomial  time 
algorithms.  More  gneraUy,  NP-hardness  often  reflects  a  certiun  absence  of  structure  which  limits 
the  nature  of  theoretical  results  that  can  be  established.  In  practical  terms,  iVP-hardness  usually 
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means  that  a  problem  should  be  approached  differently:  instead  of  trying  to  develop  an  algorithm 
which  can  be  proved  to  work  efficiently  all  of  the  time,  effort  should  be  concentrated  on  easier 
special  cases  or  on  heuristics  that  work  acceptably  well  most  of  the  time;  usually,  this  is  to  be 
determined  by  extensive  experimentation  rather  than  by  theoretical  means. 

DECENTRALIZED  DECISION  MAKING.  Witsenhausen’s  problem  [W68]  is  the  simplest 
conceivable  generalisation  of  linear  quadratic  Gaussian  control  (LQG)  to  a  nonclassical  informa¬ 
tion  pattern  (decentralized  control).  The  solution  to  this  problem  has  remained  unknown  despite 
persistent  efforts.  An  explanation  is  provided  by  the  fact  that  this  problem,  suitably  discretized, 
is  iVP-hard  [PT86].  The  point  here  is  not  that  we  might  wish  to  solve  Witsenhausen’s  problem 
computationally;  iVP-hardness  is  an  indication  that  the  problem  is  fundamentally  more  difficult 
—  and  less  structured  —  than  its  centralized  analog  (LQG). 

The  above  result  is  not  fuUy  satisfactory  because  it  does  not  rule  out  the  possibility  that 
iVP-hardness  is  only  a  consequence  uf  the  problem  discretization,  not  of  the  inherent  difficulty 
of  the  original  problem.  While  it  seems  difficult  to  establish  a  complexity  result  for  the  original 
(continuous)  version  of  the  problem,  results  similar  to  iVP-hardness  have  been  established  for  a 
related  problem  which  we  discuss  below. 

In  the  team  decision  problem  [MR72],  we  are  given  two  random  variables  yi,yj,  with  known 
joint  probability  distribution,  and  a  cost  function  c  :  SI.  Agent  i  (i  =  1,2)  observes  the  value 

of  yi  and  makes  a  decision  according  to  a  rule  Uj  =  7,(y<).  A  cost  of  c(yi,y2,Ui,U2)  is  then 
incurred.  The  problem  is  to  find  rules  7i  and  72,  so  as  to  minimize  the  expectation  of  the  cost. 
This  problem  is  iVP-hard  for  the  case  where  the  y^s  have  finite  range  [PT82,TA85].  It  remains 
iVP-hard  even  for  a  special  case  that  arises  in  decentralized  detection  [TA85]. 

In  the  continuous  version  of  the  problem,  we  take  the  random  variables  yj  and  y2  to  be 
uniformly  distributed  on  [0,1].  The  function  c  is  assumed  to  be  Lipschitz  continuous  with  known 
Lipschitz  constant.  Such  a  fimction  cannot  be  represented  by  a  binary  string,  as  required  by  digital 
computers,  and  for  this  reason,  we  need  a  suitably  adapted  model  of  computation;  we  assume  that 
a  digital  computer  obtmns  information  on  the  fimction  c  by  submitting  queries  to  an  “oracle”;  a 
typical  query  consists  of  a  rational  (hence  finitely  describable)  vector  in  the  domain  of  c,  together 
with  an  integer  k;  the  oracle  returns  the  k  most  signific2int  bits  of  the  value  of  c  at  that  point. 
Finally,  there  is  an  accuracy  peirameter  c;  instead  of  looking  for  an  optimal  solution,  we  only  desire 
a  solution  whose  expected  cost  comes  within  e  of  the  optimal.  The  “cost”  or  running  time  under 
this  model  of  computation  can  be  viewed  as  consisting  of  two  parts:  a)  the  number  of  queries 
(information  cost)  and  b)  the  time  spent  on  actual  computations  (combinatorial  cost). 

For  the  continuous  version  of  the  team  decision  problem,  the  cost  of  information  is  only  0(l/€*); 
it  suffices  to  learn  the  value  of  the  function  c  at  points  on  a  grid  with  spacing  e  and  £tny  smaller 
number  of  queries  is  insufficient.  On  the  other  hand,  assuming  that  P  ^  iVP,  there  exists  no 
algorithm  that  solves  the  problem  with  accuracy  e  in  time  polynomial  in  l/e  [PT86]. 

Two  remarks  are  in  order: 

a)  For  many  problems  whose  complexity  has  been  studied  within  this  framework,  the  information 
cost  and  the  combinatorial  cost  are  comparable.  Examples  can  be  found  in  numerical  integra¬ 
tion,  numerical  integration  of  PDEs,  nonlinear  programming,  etc.  In  contrast,  we  have  here 
an  exponential  gap  between  the  two  types  of  costs. 

b)  Problems  with  closed  form  solutions  often  have  complexity  which  is  polynomial  in  the  logarithm 
of  1/e.  Even  problems  like  PDEs  have  complexity  of  the  order  of  1/e*,  where  the  exponent  k 
depends  on  the  dimension  of  the  problem  and  the  smoothness  of  the  data.  In  this  respect,  the 
team  decision  problem  is  significantly  harder  than  most  PDE  problems. 

MARKOV  DECISION  THEORY. 
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Consider  a  controlled  Markov  chain  x(f)  that  takes  values  in  the  finite  set  {1, . . . ,  n}.  We  are 
given  its  transition  probabilities  p{x{t  +  1)  |  e(t),u(t)),  where  u(t)  is  the  control  iq>plied  at  time 
t.  The  cost  per  stage  is  of  the  form  c(e(t),u(t))  and  there  is  a  ^count  factor  a  €  (0,1).  The 
objective  is  to  minimise  the  infinite  horison  expected  cost 

t=0 

over  all  causal  control  policies.  The  key  to  solving  this  problem  is  Bellman’s  equation: 

V(i)  =  imn[c(t,tt)  +  «  X]  I  *>  “)^(i)] 

i 

While  there  are  several  algorithms  for  solving  Bellman’s  equation  (e.g.,  policy  iteration,  suc¬ 
cessive  approximation,  etc.),  none  of  the  algorithms  of  this  type  is  known  to  run  in  poljmomial 
time.  On  the  other  hand,  this  problem  is  polynomially  solvable  because  it  can  be  transformed  to 
linear  programming  [B87]. 

The  problem  becomes  more  difiSicult  if  the  control  is  to  be  chosen  on  the  basis  of  imperfect 
information.  Suppose  that  at  each  time  t,  we  observe  y(t)  =  h(z(t)),  where  h  is  a  known  function. 
We  restrict  to  policies  in  which  the  current  decision  u(t)  is  determined  by  a  rule  of  the  form 
u(t)  =  -F(y(0),p(l),...,y(<),t).  If  we  let  p(i,t)  =  PT(x(t)  =  i  |  past  history),  the  problem  can 
be  reformulated  as  a  perfect  information  problem  with  state  vector  p(t)  =  (p(l,t),...,p(n,t)). 
The  cost-to-go  function  V'(p(t))  is  known  to  be  piecewise  linear  in  p  and  this  leads  to  a  finite 
algorithm  [SS73],  at  least  for  the  case  of  finite-horizon  problems.  Unfortunately,  the  number  of 
linear  Taces”  can  increase  exponentially  with  the  time  horizon,  and  so  does  the  required  algorithm. 
Are  more  efficient  algorithms  likely  to  exist?  The  answer  is  probably  negative  because  the  problem 
is  NP-hard  (PT87]. 

There  are  analogs  of  these  results  that  apply  to  the  problem  of  supervisory  control  of  discrete- 
event  systems,  as  formulated  by  Ramadge  and  Wonham  [RW87].  These  problems  are  similar 
to  the  problems  of  Markov  decision  theory  except  that  the  transition  probabilities  are  not  given 
(or  may  not  exist)  and  the  problem  consists  of  finding  feedback  laws  that  are  guaranteed  to  avoid 
certain  imdesirbale  states.  While  the  perfect  information  problem  was  known  to  be  polynomial,  the 
corresponding  imperfect  information  problem,  as  well  as  the  corresponding  problem  of  decentralized 
control,  are  NP-hard  [T88]. 

The  Markov  decision  problem  hets  also  been  extensively  studied  for  the  case  of  continuous  state 
spaces.  In  a  simple  version  of  the  problem,  we  may  assume  that  the  state  and  the  control  take 
values  in  [0, 1].  Bellman’s  equation  becomes 

V(*)  =  min[c(a:,u)-Ha  /  p(y  j  x,u)V(y)dj/] 

*  •'o 

Let  us  assume  that  the  functions  c  and  p  have  bounded  first  derivatives.  If  we  wish  to  solve  Bellman’s 
equation  with  accuracy  e,  it  is  not  hard  to  show  that  0(1  f^)  “oracle  queries”  suffice;  it  turns  out 
that  this  many  queries  are  also  necessary  [CT89].  The  nattiral  iterative  method  for  this  problem 
(successive  approximation)  has  computational  complexity  0((l/€^)  log(l/6)):  the  cost  per  iteration 
is  0(l/e*)  and  log(l/e)  iterations  are  needed  to  get  within  e  of  the  solution.  In  fact,  the  logarithmic 
gap  between  the  lower  bound  of  0(1/6^)  zmd  the  performance  of  successive  approximation  can  be 
closed.  It  turns  out  that  a  “one-way  multigrid”  method  solves  the  problem  with  a  total  of  0(1 /e*) 
arithmetic  operations  and  is  therefore  an  optimal  algorithm  [CT91].  The  key  idea  is  that  most 
iterations  are  performed  on  coarse  grids  and  are  therefore  relatively  inexpensive. 
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QTHF.'R.  RRSTTLTS.  We  mention  briefly  some  more  problems  in  control  theory  for  which  com¬ 
plexity  theoretic  results  are  available. 

Nonlinew  controllability:  The  question  here  is  whether  it  is  possible  to  generalise  Kalman’s 
controllability  conditions  to  nonlinear  systems.  Consider  a  bilinear  system  of  the  form 


—  =  (A  -1-  ^2  -I-  Bu,  X  G  M, 


where  Af  is  a  manifold  defined  as  the  sero  set  of  a  given  set  of  polynomials.  It  turns  out  that  deciding 
whether  such  a  system  is  controllable  (i.e.,  whether  every  state  can  be  reached  firom  any  other  state) 
is  NP-haxd  [S88].  As  a  result,  whatever  necessary  and  sufilcient  conditions  for  controllability  are 
found  will  be  computationally  very  difiicult  to  test  (unless  P  =  NP). 

Robiut  control:  Consider  the  I’near  system  dx/dt  =  Ax  and  assume  that  A  =  Ao  +  <>1  Ai  -f- 
- 1-  Ok  At ,  where  each  Oj  is  an  unknown  parameter  that  lies  in  [0, 1].  We  are  interested  in  prop¬ 
erties  of  the  plant  that  are  valid  for  all  choices  of  ai,...,ak.  It  turns  out  that  many  interesting 
problems  within  this  framework  are  unlikely  to  have  computationally  efficient  solutions.  For  ex¬ 
ample,  deciding  whether  A  is  guaranteed  to  be  no^^singular  or  stable  is  JVP-hard  [PR92,N92].  As 
a  corollary  of  this  result,  computing  the  structured  singular  value  of  a  linear  system  is  also  an 
JVP-hard  problem  [BYDM92]. 

Simultaneous  stabilisation:  Let  there  be  given  matrices  A^ , . . . ,  At  and  B.  The  problem 
is  whether  there  exists  a  matrix  K  such  that  A,-  —  BK  is  stable  for  aU  :.  This  problem  can  be  also 
shown  to  be  iVP-hard. 

Control  of  queueing  systemss  Consider  the  standard  problem  of  controlling  a  closed  queue¬ 
ing  network  with  several  servers  and  several  customer  classes.  (Control  involves  routing  decisions 
for  customers  that  complete  service  and  sequencing  decisions;  the  latter  deal  with  choosing  which 
customer  to  serve  next  at  any  given  server  with  several  customers  in  queue.)  The  objective  is  to 
find  a  control  policy  that  minimizes  the  expected  werjhted  siun  of  queue  lengths.  This  problem 
is  iVP-hard  even  in  the  special  case  wi:ere  the  service  time  distributions  are  exponential  [PT93]. 
In  addition,  its  computational  complexity  is  exponential  for  the  case  of  deterministic  service  times 
[PT93].  Note  that  the  latter  result  is  stronger  than  zmy  of  the  results  mentioned  earlier  in  this 
paper  in  that  it  does  not  rely  on  the  validity  of  the  conjecture  P  /  NP. 

CLOSING  COMMENTS.  Complexity  theory  can  provide  new  insights  to  control  problems. 
However,  any  results  obtained  have  to  be  interpreted  with  caution.  Proving  that  a  problem  is 
NP-hard  does  not  mean  that  the  problem  is  intractable  and  that  work  on  it  should  be  abandoned. 
Rather,  a  different  line  of  attack  may  be  called  for. 
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ABSTRACT 

Incoherent  phase  transitions  are  more  difficult  to  treat  than  their 
coherent  counterparts.  The  interface,  which  appears  as  a  single  surface  in 
the  deformed  configuration,  is  represented  in  its  undeformed  state  by  a 
separate  surface  in  each  phase.  This  leads  to  a  rich  but  detailed  kinematics, 
one  in  which  defects  such  as  vacancies  and  dislocations  are  generated  by 
the  moving  interface.  In  this  paper  we  discuss  incoherent  phase  transitions 
in  the  presence  of  deformation  and  mass  transport,  neglecting  inertia.  The 
phase  interface  is  presumed  sharp  and  structured  by  energy  and  stress. 
The  final  results  are  a  complete  set  of  interface  conditions  for  an  evolving 
incoherent  interface. 

KINEMATICS 

In  a  coherent  phase  transition  the  body  B  occupies  a  fixed  region  of 
space  in  a  uniform  reference  configuration,  the  individual  phases,  which  we 
label  i=l,2,  occupy  complementary  subregions  Bj(t)  of  B,  and  motions  are 
continuous  across  the  undeformed  phase  interface  S(t)  =  B],(t)nB2(t).  As  is 
clear  from  the  statical  treatments  of  Cahn  and  Larch6  [1982],  Larch6  and 
Cahn  [1985],  and  Leo  and  Sekerka  [1989],  incoherent  phase  transitions  are 
far  more  complicated.  The  interface,  which  appears  as  a  single  surface  in 
the  deformed  body,  is  represented  in  its  undeformed  state  by  a  separate 
surface  Sj(t)  for  each  phase  i,  even  though  we  choose  uniform  reference 

^Supported  by  the  U.  S.  Army  Research  Office.  This  paper  presents  a  synopsis  of 
Cermelli  and  Gurtin  [1994a, 1994b] 
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configurations  for  the  two  phases  with  corresponding  reference  lattices  coin¬ 
cident.  Such  complications  lead  to  a  rich  but  detailed  kinematics,  one  in 
which  defects  such  as  dislocations,  vacancies,  and  interstitials  may  be  gen¬ 
erated  by  the  moving  interface.^ 

A  two-phase  motion  is  a  pair  y=(yi,y2):  at  each  time  t,  y^  maps 
material  points  X  in  the  undeformed  region  Bj  for  phase  i  into  points 
x  =  yj(X,t)  in  the  deformed  body.  F  is  then  the  deformation  gradient: 
F=Vyi  in  phase  1,  F  =  Vy2  in  phase  2;  in  addition,  we  denote  by  Fj  the 
limit  of  F  as  the  interface  is  approached  from  phase  i.  Associated  with 
each  two-phase  motion  are  three  basic  kinematical  quantities: 

(1)  The  incoherency  tensor  H,  which  measures  the  stretching  and  twisting 
of  one  phase  relative  to  the  other  at  the  interface.  H  is  the  tangential 
part  of  the  relative  deformation  gradient 

H  =  F2-1Fi.  (1) 

For  any  point  x  of  the  deformed  interface,  H  is  a  linear  transform¬ 
ation  dX2=MdXjL  between  infinitesimal  line  segments  dXj  on  Sj  that 
coincide  at  x  when  deformed.  If,  for  all  such  line  segments,  dX2  =  dX^ 
(or  dX2  =  QdXj^  with  Q  a  symmetry  rotation  of  the  lattice),  then  the 
deformed  lattices  fit  together  and  the  interface  is  infinitesimally 
coherent  at  x. 

(2)  The  production-rate  of  lattice  points,  as  measured  by  the  jump  [W]  in 
the  interfacial  volume  flows  Wj  =  Vj/Jj,  where,  for  each  i,  Vj  is  the 
normal  velocity  of  Sj ,  while  is  the  surface  Jacobian  for  y^ 
considered  as  a  deformation  of  Sj. 

(3)  The  slip,  as  measured  by  the  difference  (y2)°  "  (yi)".  where  (yj)®  is 
the  time  derivative  of  yj  following  the  normal  trajectories  of  Sj(t). 

The  incoherency  tensor,  the  lattice-point  production,  and  the  slip 
completely  characteriHe  incoherency:  an  initially  coherent  motion  is 
coherent  for  all  time  if  and  only  if,  at  each  time,  the  interface  is 
infinitesimally  coherent  and  the  slip  and  lattice-point  production  vanish 
identically. 

^Dislocations  are  discussed  by  Brooks  [19521,  Nye  [19531,  Frank  [19551,  Bilby 
[19551,  Bilby.  Bullough,  and  De  Grinberg  [19641,  Christian  [1965,19851,  Bollman  [19671. 
Christian  and  Crocker  [19801.  Pond  (1985,19891.  Christian  and  Crocker  [19801,  p.  181 
and  Larch4  and  Cahn  [19851,  p.  1587  note  the  possibility  of  vacancies  and  dislocations. 
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THEORY  WITHOUT  INTERFACIAL  STRUCTURE 

The  basic  physical  principles  upon  which  the  theory  is  based  are 
balance  of  forces,  balance  of  mass,  and  a  version  of  the  second  law  of 
thermodynamics  appropriate  to  a  mechanical  theory.^  The  standard 
forces  associated  with  continua  arise  as  a  response  to  the  motion  of  material 
points.  The  mechanical  description  of  a  phase  transition  requires  additional 
forces^  that  act  in  response  to  microstructural  changes  at  the  phase 
interface.  We  refer  to  the  former  as  deformational  forces,  to  the  latter  as 
configurational  forces.^  What  is  most  important  is  that,  in  addition  to 
the  usual  force  and  moment  balances  for  deformational  forces,  we 
postulate  an  additional  balance  for  configurational  forces. 

We  assume  that  there  are  species,  a  =  1,2,...,2(,  of  mobile  atoms 
with  molar  densities  p®  and  corresponding  diffusive  mass  fluxes  h®.^ 
Bulk  fields  that  strongly  influence  the  motion  of  the  interface  are  the 
grand  canonical  potential  co  and  the  Eshelby  tensor  P  defined  by 

CO  =  $  -  2  p®u®,  P  =  col  -  F^S,  (2) 

a»  1 

with  ^  the  bulk  energy,  iJi®  the  chemical  potential  of  species  a,  S  the 
bulk  stress,  measured  per  unit  undeformed  area  (Piola-Kirchhoff  stress), 
and  1  the  unit  tensor. 

The  final  bulk  relations  are  the  balance  laws 

DivS  =  0,  (p®)'  =  -Divh®,  (3) 

supplemented  by  constitutive  equations 

i  =  l^(F,p),  S  =  aFl,(F,p), 
b  =  -D,(F,p)Vn, 


H  =  c)pii(F,p), 


(4) 


^Cf.  Gurtin  [1991]. 

^Cf.  the  discussion  given  in  the  Introduction  of  Gurtin  [1990], 

^We  depart  from  terminology  introduced  in  Gurtin  and  Struthers  [1990]  and 
Gurtin  [1993],  where  the  term  accretive  forces  was  used. 

^Cf.  Gurtin  and  Voorhees  [1993],  Gurtin  [1993]. 
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for  each  phase  i.  with 


p  *  (p^,...,p*).  UL  =  (UL^....UL*).  b  =  (h^,...,h*). 

We  next  consider  appropriate  interface  conditions.  To  best  illustrate 
the  basic  ideas,  we  begin  with  a  theory  that  neglects  interfacial  energy  and 
stress,  but  includes  interface  kinetics.  The  resulting  interface  conditions 
consist  of  an  equation 

[y‘]‘n  =  -[JW]  (5) 

expressing  kinematical  compatibility  at  the  interface,  a  jump  condition 

[y^Sn]  =  0  (6) 

balancing  forces  across  the  interface,  equations 

6,n^.P^n^  =  (p.iWj  +  (7) 

(i=l,2)  balancing  normal  configurational  forces  on  each  phase  at  the  inter¬ 
face,  equations 


(8) 


(i=l,2)  characterizing  the  vanishing  of  the  tangential  traction  in  each  phase 
at  the  interface,  a  relation 


[p®W]  =  [r^h«-n]  (9) 

expressing  mass  balance  for  each  species  a,  and  a  condition  of  local 
equilibrium 


[h«l  =  0  (10) 

for  each  species  a.  Here  h  is  the  unit  normal  to  the  deformed  interface  Z, 
Hj  is  the  unit  normal  to  the  undeformed  phase  i  interface  S^,  [f]  denotes 
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the  jump  in  a  bulk  field  f  across  the  interface,  fj  denotes  the  interfacial 
limit  of  f  from  phase  i,  and  Py  are  kinetic  coefficients. 

In  the  derivation  of  these  interface  conditions  the  slip  was  not 
included  among  the  independent  constitutive  variables,  a  direct  consequence 
of  this  assumption  is  (8).  The  local  equilibrium  condition  (10)  is  an 
assumption  made  from  the  outset.^ 

The  balances  (6)-(8)  can  be  expressed  more  succintly  as  a  normal 
force  balance 

ir^Snl-h  =  0  (11) 

and  a  partial  balance 

=  (p^iWi  +  P.2W2)n,  (12) 

for  each  phase  i. 

THEORY  WITH  INTERFACIAL  STRUCTURE 

We  turn  next  to  a  theory  that  includes  interfacial  energy  and  stress, 
but  neglects  mass  flow  within  the  interface.  Following  Cahn  and  Larch6 
[1982],  we  choose  one  of  the  phases,  phase  1,  as  a  reference  for  the 
interface,  and  measure  interfacial  fields  relative  to  Sj.  Here  it  is  con¬ 
venient  to  refer  to  phase  1  as  the  parent  phase  and  to  phase  2  as  the 
product  phase,  and  to  use  the  abbreviations 

S  =  Sj ,  n  =  n^. 

We  consider  a  single  interfacial  energy  ip,  but  endow  the  interface 
with  three  stress  fields: 

•  a  deformational  stress  S  that  represents  the  (Piola-Kirchhoff)  stress  in 
the  surface  and  acts  in  response  to  the  stret^ning  of  the  parent  interface; 

•  a  configurational  stress  C  that  represents  microstructural  forces  in  the 
parent  interface; 

•  a  configurational  stress  K  that  acts  in  response  to  the  stretching  and 
rotation  of  the  product-phase  lattice  relative  to  the  parent-phase  lattice. 

^Cf.  Gurtin  and  Voorhees  [1994],  who  develop  a  theory  in  which  this  assumption  is 
dropped.  Their  theory  neglects  deformation. 
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A  consequence  of  thermodynamics  is  that  the  tangential  part  of  the  total 
surface  stress 

A  =  C  +  Fi^S  +  (13) 

which  represents  the  net  configurational  contribution  of  the  stresses  to  the 
rate  of  working,  is  a  surface  tension  whose  value  is  the  interfacial  energy  4^. 

Among  the  constitutive  equations  considered  for  the  interface  are 
relations  giving  the  interfacial  energy  ij),  the  surface  stresses  8  and  K, 
and  the  normal  part  B-A^n  of  the  total  surface  stress  as  functions  of  the 
limiting  value  F  =  Fi  of  the  deformation  gradient,  the  limiting  values 
and  P2  of  the  list  of  densities,  the  normal  n  to  S=Si,  and  the  volume 
flows  Wj  and  W2.  Consequence  of  the  second  law  are  that  4^,  8,  K,  and 
B  are  independent  of  p^,  P2.  and  W2;  and  that  the  energy 

^  =  4'(F,H,n)  (14) 

generates  the  stresses  through  the  relations 

8  =  aF4-(F,H,n),  K  =  0H4.(F,H,n),  B  =  -  ;i.(F,H,n).  (15) 

We  show  further  that  ^J;,  §,  and  K  depend  on  F  and  H  through  the 
tangential  deformation  gradient  F  and  the  incoherency  tensor  H,  that 

8  =  0p4;(F,H,n),  K  =  0||(^(F,H,n),  (16) 

and  that  C  =  C’’n  is  given  by 

e  =  -D„;i^(F,H,n),  (17) 

with  D„  the  derivative  following  n. 

The  final  results  —  which  form  a  complete  set  of  conditions  for  an 
incoherent  interface  —  consist  of  the  compatibility  condition  (5),  the  mass 
balance  (9),  the  local  equilibrium  condition  (10),  an  equation 
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(18) 


-  (F'^S  +  H’’K)*L  +  Div3C  -  ii'P^n  =  +  ^12^2 

that  represents  a  normal  configurational  balance  for  phase  1,  an  equation 

DivgK  +  KPjHj  =  KfpjiWj  +  ^22^2)02  (19) 

that  represents  a  configurational  balance  for  phase  2,  a  deformational  force 
balance 


DivsS  +  HS2n2  -  =  0,  (20) 

and  the  constitutive  relations  (16)  and  (17).  Here  K  =  |j/|2.  while 
L  =  Li=-V3n  and  K  =  Ki  =  trL.  respectively,  are  the  curvature  tensor  and 
the  total  (twice  the  mean)  curvature  for  S. 
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Abstract 

A  system  of  equations  modeling  antiplane  shear  in  a  granular  material  is  considered.  The 
model  includes  the  possibility  of  localization  of  strain,  and  the  subsequent  development  of  shear 
bands.  This  behavior  is  captured  in  our  analysis  of  the  Riemann  initial  value  problem,  in  which 
an  initial  discontinuity  propagates  as  a  combination  of  moving  waves  and  a  stationary  shear 
band.  The  analytic  solution  is  used  to  test  numerical  simulations  based  on  Godunov’s  method 
with  front  tracking  aud  adaptive  mesh  refinement. 

1.  Introduction 

We  consider  a  model  [4]  for  dynamic  deformations  of  granular  materials  which  allows  for 
the  loc2ilization  of  flow  and  the  consequent  development  of  shear  bands.  We  focus  on  Riemann 
initial  value  problems  in  one  space  dimension  that  include  a  shear  band.  An  unusual  feature 
of  the  solutions  is  that  they  are  not  scale  invariant.  The  structure  of  the  solution  (shown  in 
Figure  1)  includes  the  feature  that  the  material  unloads  elastically  between  the  shear  band 
and  a  free  boundary  that  propagates  into  a  region  of  plastic  deformation.  Mathematically,  the 
Riemann  problem  reduces  to  solving  a  free  boundary  problem  for  the  (linear)  wave  equation.  We 
summarize  short  time  existence  results  and  long  time  behavior,  the  details  of  which  are  given 
in  a  series  of  papers  [2,  5].  Then  we  outline  the  governing  principles  in  designing  an  efficient 

^Research  supported  by  NSF  grant  DMS  9201115,  which  includes  funds  from  AFOSR,  and  by  ARO  grant 
DAAL03-91-G-0122. 

^Research  supported  by  NSF  grant  DMS  9201115,  which  includes  funds  from  AFOSR. 

^Research  supported  by  NSF  grant  DMS  9201034,  which  includes  funds  from  AFOSR. 

^Research  supported  by  NSF  gramt  SES-DMS-9201361,  by  DNA  grant  DNA001-92-C-0166,  and  by  DOE  grant 
DE-FG05-92ER25145 
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numerical  simulation  code,  and  present  numerical  results.  The  numerical  results  are  in  excellent 
agreement  with  the  predictions  of  the  analysis.  The  simulation  code  is  designed  so  that  it  can 
generalize  to  higher  dimensions;  it  is  currently  being  extended  to  two  dimensions. 

The  following  system  of  equations  describes  antiplane  shear  deformations  that  depend  on  one 
space  variable  x  and  time  t  [4]. 


=  dxa 

(«) 

dxV  \ 

0  J 

1  (if  loading) 

(6) 

A 

dxV  ^ 
0 

1  (otherwise). 

(c) 

The  dependent  variables  v  and  T  =  (<t,  t)^  represent  velocity  and  stress  respectively,  all  other 
entries  of  the  full  stress  tensor  being  constant.  is  the  transpose  of  T.  The  constant  c  is  the 
elastic  wave  speed,  /  is  the  2x2  identity  matrix  and  R  is  the  rotation  matrix 


(cos 
-si 


cos  ^  sin  j3 
~  sin  cos  ^ 


with  parameter  /9  €  (0,  |)  measuring  the  degree  of  nonassociativity  in  the  model  (specifically  in 
the  flow  rule).  The  given  function  h  =  h{'y)  is  the  hardening  modulus,  depending  on  the  yield 
stress  7.  The  function  h  is  nonnegative  and  monotonically  decreasing  on  [0,1],  with  h{l)  =  0. 

Equation  (1.1a)  is  conservation  of  momentum,  while  equations  (l.lb,c)  specify  constitutive 
behavior.  This  behavior  is  described  as  loading  (or  plastic)  when  \T\  is  at  its  maximum  over 
previous  time  and  is  increasing.  That  is,  the  material  is  loading  when  it  is  at  plastic  yield: 

|r(x,t)|  =7(x,t)  =  maxjr(i,s)|,  (1.3) 

and  the  right  hand  side  of  (1.3)  is  increasing: 

dn  >  0.  (1.4) 

When  |T(x,  t)|  <  7(x,  t),  the  behavior  is  described  as  elastic.  When  (1.3)  holds  and  dt'y  =  0,  we 
use  the  terminology  neutral  loading. 

We  have  a  differential  equation  for  7  : 


dti  = 


=  {o 


\T\  if  loading 
otherwise 


The  Riemann  problem  for  system  (1.1)  is  the  initial  Value  problem  with  initial  data  of  the 
form 

til  T  7l)  if  X  <  0  ti 


subject  to  (r(x,0)|  <  7(x,0).  In  solving  Riemann  problems  for  a  simplified  version  of  system 
(1.1),  Garaizar  [1]  foimd  initial  data  (1.6)  for  which  there  is  no  scale  invariant  solution.  The 
reason  is  that  equations  (1.1)  lose  hyperbolicity  as  7  approaches  a  critical  value  7  =  7*,  and  the 
classical  construction  of  scale  invariant  solutions  breaks  down. 

When  the  equations  lose  hyperbolicity,  they  also  lose  linear  well-posedness,  amd  we  suppose 
that  a  shear  band  forms.  As  in  [4],  we  treat  the  shear  band  as  a  stationary  discontinuity,  with 
nonstandard  jump  conditions.  Specifically,  the  velocity  gradient  is  approximated  by  a  divided 
difference: 

Vz  w  lv]/6,  (1.7) 

where  (u]  is  the  jump  in  velocity  across  the  shear  band  and  ^  is  a  small  parameter  to  the  grain  size. 
From  the  conservation  of  momentum  equation  (l-la),  we  see  that  <7  is  continuous  across  a  shear 
band.  The  variable  r  experiences  a  jump  on  each  side  of  the  shear  band.  The  approximation  (1.7) 
leads  to  the  following  system  of  ordinary  differential  equations,  with  a  constraint,  for  evolution 
of  the  variables  T  =  (<r,  r)^,7  within  the  shear  band. 

['  +  (  T  )  ■  ^  =  1^1-  (1.8) 

Note  that  these  equations  are  coupled  to  the  external  variables  through  the  jump  [u]  in  v, 
and  through  the  continuity  of  <t.  This  system  of  equations  may  be  regarded  as  a  jump  condition 
for  stationary  shocks,  analogous  to  the  usual  Rankine-Hugoniot  condition  for  shocks.  However, 
the  Rankine-Hugoniot  condition  is  a  system  of  algebraic  equations,  in  contrast  to  (1.8),  which  is 
a  system  of  differential  equations.  The  jump  condition  (1.8)  effectively  widens  the  class  of  weak 
solutions  of  equations  (1.1)  beyond  the  class  of  solutions  whose  jump  discontinuities  satisfy  the 
usual  Rankine-Hugoniot  conditions.  It  is  within  this  wider  class  that  we  shall  seek  solutions  of 
Riemann  problems.  Note  that  system  (1.8)  is  not  scale  invariant,  because  of  the  right  hand  side. 
Correspondingly,  solutions  of  Riemann  problems  also  fail  to  be  scale  invariant. 

In  Section  2,  we  extend  results  of  Garaizar  [1]  concerning  the  Riemann  problem,  to  the  case 
in  which  a  shear  band  forms.  In  Section  3,  we  describe  numerical  results  in  a  test  caise  that  shows 
how  the  computations  agree  with  the  theoretical  predictions. 

2.  Analytic  Solution  of  the  Riemann  Problem. 

In  this  section,  we  review  and  extend  results  of  Garaizar  [1]  concerning  Riemann  problems 
for  system  (1.1).  In  Subsection  2.1,  we  summarize  short  time  existence  and  asymptotics,  which 
are  used  in  Subsection  2.2  to  extend  the  solution  globally  in  time.  The  analysis  applies  to 
a  simplified  version  of  systems  (1.1, 1.8),  in  which  we  linearize  the  yield  condition  (1.3),  and 
work  with  perturbations  of  the  original  variables  about  the  point  at  which  system  (1.1)  loses 
hyperbolicity.  The  simplified  version  of  equation  (1.1)  is 
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(2.1) 


dtv 

=  dxff 

(a) 

dta  + 

^(7) 

=  <^dxV 

W 

dtT  - 

^a«7 

Hv 

=  0 

(c) 

(  dt((T  +  ar)  if  loading 

d«7 

= 

id) 

0  otherwise, 

Here,  a  =  tan  h{‘y)  is  a  positive  strictly  decreasing  function  on  an  interval  containing  the  point 

(2.2) 


7  =  0,  and 


h{Q)  = 


Note  that  in  the  simplified  equations,  we  use  the  same  symbols  a,  t  and  7  to  denote  perturbations 
of  the  original  stresses  <r,  t  and  yield  stress  7. 


2.1  Short  time  behavior. 

Let  U  =  r)^,  and  write  system  (2.1a, b,c)  in  the  loading  case  (in  which  7  =  <t  +  or)  in 

the  form 

Ut  +  BU,  =  0,  (2.3) 


where 


B  = 


1 


0  h-a^  +  l  0\ 
<?{h  -  a^)  0  0 

oi<?  0  0  y 


/i  —  +  1 

Characteristic  speeds  of  (2.3)  are  eigenvalues  of  B,  given  by 

A±  =  ±Cy/Tj,  Ao  =  0, 


(2.4) 


(2,5) 


where  rj  =  {h  —  a^)/{h  —  +  1).  The  associated  eigenvectors  are  (respectively) 

r±  =  -  v)V ,  tq  =  (0,0,1)'^.  (2.6) 

We  conclude  that  system  (2.3)  is  hyperbolic  if  and  only  if  r/  >  0.  Therefore,  (2.3)  is  hyperbolic 
if  and  only  if  7  <  0. 

Next  we  describe  the  values  of  the  variables  within  a  rarefaction  wave  near  7  =  0.  Let 
t;o,cro,To  =  — a"*<7oi7o  =  0  be  the  values  of  the  variables  at  x  =  0  in  a  right  moving  rarefaction 
wave  whose  trailing  edge  has  speed  zero.  Then  (cf.  [5])  we  have  the  following  expressions  for  the 
variables  near  x  =  0,  in  which  ^  =  xjt : 
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7  =7({) 


=  -^ +  {*/■«’).  (<•) 
V  =  t;(0  =  Vo  +  V3(^  +  (ft) 

<r  =  <To-<T4f*  +  ^*/3(^^),  (c) 


(2.7) 


where 


r  =f(0  =ro- 
2 


akic^ 


Vs  = 


Sa^/iiC*’  2a^hiC*' 

and  the  functions  /,,  t  =  1,  ..,4  are  real  analytic  near  the  origin  if  h  is  real  analytic. 

The  other  ingredient  we  require  is  an  integrated  form  of  the  follwing  simplified  form  of  equa¬ 
tions  (1.8): 


e+(*Me),  id) 

hi  =  h\0)  <  0, 


(2.8) 


"""ft 

(a) 

j(^,a,7  =0 

(ft) 

(2.9) 

7  =  <7  -1-  az. 

(c) 

Lemma  2.1  Suppose  h{'f)  is  real  analytic  in  a  neighborhood  0/7  =  0,  ft(0)  =  ^  0,  and 

/i'(0)  =  —hi  <  0.  Let  (of,2:,7,  [v])(<)  satisfy  equations  (1.8),  and  (a,z,7)(0)  =  (<To,  — (To/a,0)  for 
some  Go-  Then  there  is  a  function  b  =  6(7)  that  is  real  analytic  in  a  neighborhood  0/7  =  0  such 
that 

6(0)  =  (To,  ft'(0)  =  0,  6"(0)  =  -aVfti,  (2.10) 

and  with  the  property  that 

<7(i)  =  6(^^jf'HW<»).  (2.11) 


Now  we  are  ready  to  consider  the  central  problem.  Consider  the  Riemann  problem  with 
initial  data 


{  Ul  if  a:  <  0 


t/(x,0)  = 


(2.12) 


[Ur  if  X  >  0 

for  which  there  is  no  centered  solution  of  system  (2.1)  involving  only  shocks  and  rarefactions. 
Specifically,  consider  a  combination  of  left  moving  shocks  and  rarefactions  such  that  the  value 
of  U  to  the  left  of  this  combination  is  Ul  and  the  trailing  edge  of  the  rarefaction  has  zero  speed 
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(i.e.,  7  =  0.).  Similarly,  consider  a  combination  of  right-moving  waves,  with  Ur  on  the  right, 
and  zero  speed  on  the  left.  Let  (<T°,n°),  denote  the  values  of  (<t,  n)  on  the  right  and  left 

(respectively)  of  the  left  and  right  moving  wave  groups  (see  Figure  2).  In  the  so-called  symmetric 
case,  in  which  <t°  =  <r°,  there  is  no  classical  solution  of  the  Riemann  problem  if  v°  >  Vf.  (That  is, 
there  is  no  solution  involving  only  shocks  and  rarefactions.)  This  is  the  situation  that  we  treat. 

Since  there  is  no  classical  solution,  we  explore  the  possibility  of  solving  the  initial  value 
problem  by  including  a  shear  band.  In  such  a  solution,  v  (in  addition  to  r  and  7)  can  experience 
a  jump  across  the  t-axis.  In  contrast  with  the  classical  solution,  the  solution  with  a  shear  band 
is  non-constant  on  the  t  axis  because  of  equations  (1.8),  and  the  overall  solution  does  not  enjoy 
the  property  of  scale  invariance.  To  start  with,  consider  the  equation  (1.8)  on  the  sheajr  band, 
which  is  located  on  the  <-axis.  To  understand  the  short-time  behavior,  we  rescale  time  ^  by  a 
small  constant:  =  t/e.  Then  dt  =  \dt',  so  that  the  equations  (1.8)  are  unchanged  apart  from 
an  c  multiplying  the  right  hand  side.  Then  if  e  is  small  compared  with  8,  we  effectively  have 
scale  invariant  equations;  viz., 


dfCT  -I-  =  0 

^(7) 

dt'T  -  =0,  '1  =  <J  +  CIT. 

hiV 


(2.13) 


It  follows  that  a,  t  and  7  are  constant  in  time  to  leading  order  in  c.  Thus  for  small  time, 
specifically,  for  times  that  are  small  compared  to  6,  the  solution  is  approximately  scale  invariant. 
In  this  solution,  there  is  a  rarefaction  wave  on  the  left  and  right  extending  up  to  the  t-axis.  The 
blovm-up  solution  is  shown  in  Figure  2. 

We  modify  this  picture  for  larger  times  as  follows.  The  solution  is  no  longer  scale  invariant, 
and  the  material  unloads  away  from  the  shear  band.  We  therefore  postulate  an  unloading  or 
relief  wave  propagating  away  from  the  shear  band  and  interacting  with  the  rarefaction.  The 
conjectured  structure  is  shown  in  Figure  1. 


In  Figure  1,  the  solution  is  scale  invariant  outside  the  region  bounded  by  the  relief  waves, 
and  agrees  with  the  blown-up  solution  there.  We  therefore  have  a  pair  of  coupled  quarter-plane 
problems  to  solve  in  the  regions  Et,  Er  of  Figure  5,  with  the  boundaries  in  the  (a:,<)  plane  being 
the  two  relief  waves  and  the  f-axis.  Note  that  regions  E(,  Er  have  t  <  T,  where  T  is  chosen  so 
that  the  relief  wave  has  not  completely  penetrated  the  rarefaction  by  time  T. 

Finding  the  solution  in  regions  Et,  Er  reduces  to  solving  a  Goursat-type  free  boundary  prob¬ 


lem  for  the  wave  equation 


dtv  =  dxCr  (a) 

dta  =  c^dxV  (b) 


(2.14) 
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Figure  1 .  Solution  of  the  Riemann  Problem  in  the  Symmetric  Case. 


in  the  planar  domain 


{(x,t) :  f  >  0,  0  <  X  <  s(<)}. 


(2.15) 


At  the  free  boundary,  (2.14)  is  subject  to  two  boundary  conditions  of  Dirichlet  type,  correspond¬ 
ing  to  the  rarefaction  wave: 


u(s(t),t)  =v{s{t)/t)  (a) 

<7(s(<),<)  =a{3{t)/t).  (6) 


At  {x  =  0},  system  (2.14)  is  subject  to  the  nonlinear,  integral  condition 

<T(0,f)  =  b  , 


(2.17 


derived  from  Lemma  2.1.  (Note  that  by  adding  a  constant  to  v,  we  may  take  [y]  =  2v  without 
loss  of  generality.)  Substituting  d’Alembert’s  solution,  written  in  the  form 

f  =  +  j,  0<»<«(<)  (2.18) 

into  the  boundaury  conditions,  we  obtain  the  following  three  equations  for  the  three  unknown 
functions  F,G,s: 

F{ct  +  s(<))  +  G{ct  -  a{t))  =  v{s{t)lt)  (a) 

f\<A  +  s{t))  -  G{ct  -  3{t))  =  (6)  . 

F{ct)-G{ct)  =c-n\^^J^{F{ce)  +  Gice))de\  (c) 

We  can  derive  short  time  asymptotic  solutions  of  these  equations,  with  the  following  result, 
in  terms  of  the  physical  variables  ([5]): 

Proposition  2.1  For  0  <  x  <  s(t),  and  near  t  =  0,  the  functions  have  the  follomng  expansions, 
uniform  in  x. 

v{x,t)  =  vq  +  2cAt^^^  +  0{t^)  <7(1, t)  =  cTo  +  +  0(15) 

4/3  4/3  (2.20) 


4/3  4/3 

t(x,1)  =  To  -  +  O(l^)  'y{x,t)  = x^^^  +  O(l^), 

h\ClC  ll\ 


where 


A  =  >0  B  =  -st(<T4  +  =  -2a^hiC*  <  0.  (2.21) 

In  reference  [6],  we  prove  that  there  is  a  solution  for  short  time  that  agrees  with  the  asymptotic 
form  we  have  found.  The  proof  is  based  on  a  form  of  the  implicit  function  theorem  that  requires 
the  functions  to  be  real  analytic. 


2.2  Long  time  behavior. 

Once  the  local  existence  (for  t  <  to)  of  solutions  has  been  established,  we  can  extend  these 
solutions  to  all  values  of  1  using  an  iterative  method.  That  is,  we  can  show  that  if  solutions  exist 
for  0  <  1  <  1'  then  these  solutions  can  be  extended  to  a  larger  interval  0  <  1  <  1'  +  e  where  e 
depends  only  on  Iq. 

Let  us  define  a  new  function  r(l)  to  replace  the  unknown  s(l).  This  function  is  defined 
implicitly  by 

sit's 

rict  +  s{t))  = (2.22) 

V 

Then  equations  (2.19)  become: 

(a)  F  +  G  o  <f)  =  v{r{t)) 

(b)  F-Go4>  =  c-*d-(r(l)y  (2.23) 

(c)  F-G  =  c-'b(3f.}t(.F  +  G)dt'), 
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where  ^i)  =  (c  —  r(<))</(c  +  r{t)).  We  rearrange  equations  (2.23)  as  follows: 

(a)  2Fit)  =  (t;  +  c-*a)(r(0) 

(b)  2G(f^<)  =  (e  -  c-V)(r(())  (2.24) 

(c)  G«)  =  f(()-c-4(J^jJ''(f  +  G)<«') 

Since  r(t)  >  0,  we  notice  that  the  argument  of  G  on  the  LHS  of  (2.24b),  is  strictly  less 

than  <0  for  all  i  in  [0,  <o]-  Therefore  we  can  solve  (2.24b)  for  r{t),  for  values  of  t  slightly  larger 
than  to-  That  is,  we  can  extend  r(t)  to  the  interval  [0,  to  +  c],  where  c  is  determined  from  the 
inequality  t  —  to  <  if  r(t)  <  c.  Now  we  can  use  equations  (2.24a)  and  (2.24c)  to  extend 

F  and  G  to  the  same  interval. 

This  process  can  be  repeated  indefinitely  to  obtain  a  solution  to  (2.23)  for  all  t  >  0,  provided 
that  the  value  of  e  does  not  decrease;  in  particular,  if  e  depends  only  on  the  value  of  to  given 
by  the  local  existence  results.  This  will  be  true  if,  for  all  values  of  t,  (i)  r(t)  <  c  and  (ii)  r(t)  is 
monotone  increasing.  The  following  Proposition,  proved  in  [2]  provides  these  properties. 

Proposition  2.2  If  equations  (2.23)  admit  a  solution  (F,G,r)  on  some  interval  [0,r],  then  each 
of  these  functions  is  monotone  increasing  on  this  interval,  and  r(f )  <  c. 

Combining  this  analysis  with  the  local  existence  results  of  [5]  we  have  established  the  existence 
of  solutions  for  all  t  >  0. 

3.  Numerical  Results. 

In  this  section  we  will  explain  the  niunerical  approach  to  solving  system  (1.1).  A  similar 
(but  simplified)  approach  will  apply  to  system  (2,1)  with  a  linear  yield  condition.  Instead  of 
describing  the  numerical  algorithm  in  detail  we  will  address  some  of  the  numerical  difficulties 
which  arise  from  the  properties  of  the  system. 

3.1  Elasto-plastic  transition. 

The  first  difficulty  in  designing  a  numerical  code  is  that  the  equations  in  system  (1.1)  change 
depending  on  whether  the  material  deforms  elastically  (1.1c)  or  plastically  (1.1b).  In  the  nu¬ 
merical  algorithm,  we  add  an  internal  variable  to  the  set  of  variables  U  =  (v,  <t,  t,  7)^  describing 
the  material  states.  This  variable  indicates  if  a  given  material  point  is  undergoing  an  elastic 
or  plastic  deformation.  The  internal  variable  is  allowed  to  change  during  the  course  of  a  time 
update,  thus  avoiding  the  calculation  of  unphysical  values  of  stress  which  are  beyond  the  yield 
surface  (<7^  -f  >  7^). 

3.2  Stress  evolution. 

When  the  material  is  deforming  plastically,  system  (1.1),  (1.5)  is  not  in  conservation  form. 
Although  the  momentum  equation  (1.1a)  is  a  conservation  law,  the  equation  (1.1b)  describing 
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the  stress  evolution  during  plastic  deformation  is  not  in  conservation  form.  In  order  to  perform  a 
time  update  at  a  given  point,  we  use  a  numerical  scheme  (following  the  ideas  of  Trangenstein  and 
Colella  [7])  which  combines  two  different  algorithms:  (i)  a  second  order  (Godunov)  hyperbolic 
scheme  for  the  momentum  equation  2md  (ii)  a  second  order  implicit  ordinary  differential  equations 
integrator,  for  the  stress  equation  along  a  particle  path. 

The  Godunov  scheme  for  the  momentum  equation  is  ; 


n+l  n  I  2^^/ 


(3.25) 


Here  is  approximated  by  writing  the  equations  of  motion  in  quasi- linear  form  and  tracing 
information  along  characteristics.  The  algorithm  used  on  the  stress  equations  is  described  as 
follows: 


Let  S  be 


integration  of 


Then  is  the  solution  at  <  =  after  the  numerical 


(3.26) 


n+j  n+A 

U  ,  1  —  V.  1 

’  Ax-' 


with  initial  data  5(f„)  =  I  r"  I  .  Equation  (3.26)  and  in  particular  G(S)  is  derived  from  (l.lb,c) 

\  7”  / 

n+i  n+i 

V  TV+i  n+- 

and  (1.5),  replacing  dxV  with  -*■  In  both  algorithms,  and  are  the  values  of  cr 

and  V  at  the  cell  boundaries  evaluated  at  the  intermediate  time  t  =  tn  +  At/2.  These  quantities 
are  computed  using  a  characteristic  tracing  algorithm  to  achieve  second  order  accuracy. 


3.3  Loss  of  hyperbolicity. 

As  was  noted  earlier,  there  is  a  critical  value  of  yield  stress  7,  beyond  which  the  system  is 
unstable.  This  instability  relates  to  the  loss  of  hyperbolicity  in  the  full  two  dimensional  system 
for  signals  traveling  in  certain  directions.  Ph>  sically,  this  loss  of  hyperbolicity  is  associated  with 
the  formation  of  shear  bands. 

In  our  numerical  algorithm,  after  the  time  update,  each  cell  is  tested  to  see  if  the  state  on 
the  cell  is  inside  or  outside  the  region  of  hyperbolicity.  If  the  state  is  outside  this  region,  a  shear 
band  is  then  created  in  the  cell.  From  this  instant,  the  band  is  treated  as  an  internal  boundary 
with  its  own  equations  (1.8)  governing  the  evolution  of  the  stress  in  the  interior  of  the  band. 

The  numerical  algorithm  first  performs  the  update  of  the  states  on  all  the  cells  as  if  shear 
bands  are  not  present.  This  includes  the  computation  of  stress  and  velocity  at  the  cell  boundaries. 
Next  it  corrects  the  states  on  cells  where  shear  bands  are  present.  This  is  done  by  first  integrating 
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an  ordinary  differential  equation  similar  to  that  of  (3.26) 


G(56),  with  initial  data  = 


rj: 

lb 


Sbitn) 


(3.27) 


where  S  is  the  physical  width  of  the  band,  Sb  is  the  stress  vector  inside  the  shear  band,  and 
and  are  the  velocities  on  the  right  and  left  sides  of  the  band  at  time  <  =  <n. 

Shear 


band 

r 

r 

t=tn+At 

I  .  .  .  V.  .  .  .  . 

. . 

. — 4 - 

(. 

L 

U",. 

k 

ur  :tl 

» 

UV'  t=t„ 

Figure  3:  Cell  with  a  shear  band. 

As  a  final  step,  we  update  the  states  Ul  and  Ur  near  the  band;  this  is,  on  the  subcells  created 
by  the  formation  of  a  shear  band  inside  a  cell  (see  fig  3). 

The  states  at  the  band  are  used  to  compute  the  fluxes  ^  ^  ^  at  the  fictitious  boundau^y  (i.e., 

shear  band).  The  states  Ul  and  Ur  are  assigned  to  cells  of  smaller  size  (d/  and  dr)  than  a  regular 
cell.  The  possible  violation  of  the  CFL  condition  is  solved  by  redistributing  the  “numerical  mass” 
into  the  neaxby  cells.  The  result  of  this  mciss  redistribution  is  stored  as  a  modification  on  the 
fluxes  at  the  cell  boundaries  next  to  the  shear  band. 

3.4  Numerical  example 

In  our  numerical  example,  we  use  a  modification  of  the  adaptive  mesh  refinement  algorithm 
as  in  [8].  The  computations  for  the  following  example  are  performed  with  three  levels  of  mesh 
refinement,  each  a  factor  of  three  finer  than  the  previous.  This  algorithm  uses  the  flux  informa¬ 
tion  in  order  to  assure  that,  during  the  mesh  refinement  process,  the  quantities  that  should  be 
conserved  are  actually  conserved.  Thus  the  emphasis  on  expressing  the  mass  redistribution  in 
terms  of  the  fluxes. 

In  figures  4-5  we  show  the  profiles  of  the  solution  for  a  numerical  example.  We  study  an 
initial  value  problem  in  the  interval  0  <  x  <  1,  with  data 
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V  =  —70,  a  =  0.6,  T  =  0,  7  =  0.75  for  x  <  0.5 

V  =  70,  <7  =  0.6,  T  =  0,  7  =  0.75  for  x  >  0.5 
and  with  material  parameters: 

a  =  ir/6,  c  =  10^  and  ^(7)  =  1.7(1  -  7). 

This  problem  does  not  admit  a  selfsimilar  solution  and  a  shear  band  is  expected  to  form  at 
X  =  0.5  as  a  result  of  the  strong  loading  from  both  sides  of  the  initial  discontinuity. 

Figure  4  shows  the  solution  at  a  time  soon  after  the  formation  of  the  shear  band  and  Figure 
5  shows  the  solution  at  a  later  time.  The  plots  correspond  to  the  stress  functions  <7  and  7  (<t  <  7 
always).  We  observe  the  expected  behavior  of  the  stress  at  the  shear  band,  as  predicted  by 
the  analysis  above  (also  see  [5]  and  [2]).  A  precursor  elastic  wave  travels  ahead  of  a  loading 
rarefaction  wave  which  is  followed  by  an  unloading  relief  front.  Note  the  coarsening  of  the  grid  in 
regions  of  neutral  loading  (behind  the  elastic  precursor  wave)  and  in  regions  of  elastic  unloading 
(behind  the  relief  front). 

The  location  of  the  shear  band  is  readily  identified  by  the  dip  in  the  values  of  a.  The 
nonlinearity  of  the  yield  condition  ensures  that  the  stress  at  the  shear  band  will  converge  to  a 
rest  point  under  continuous  loading. 


Figure  4:  a  and  7  at  small  time. 


4.  Conclusions.  In  this  paper,  we  have  summarized  results  concerning  solutions  of  the  Riemann 
problem  for  model  equations  describing  the  deformation  of  granular  materials.  The  model  allows 
for  deformations  with  shear  bands.  The  analytic  results  on  existence  of  a  solution  depend  upon  a 
simplification  of  the  equations,  specifically  a  linearization  of  the  yield  condition  about  the  value 
of  stress  at  which  a  shear  band  forms.  We  outline  the  main  features  of  a  numerical  method  that 
is  based  on  a  higher  order  Godunov  method,  and  which  includes  front  tracking  and  adaptive 
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Figure  5:  a  and  7  at  large  time. 

mesh  refinement.  This  combination  of  techniques  gives  sharp  resolution  of  the  shear  band,  and 
accommodates  large  elastic  wave  speeds.  The  way  adaptive  mesh  refinement  coarsens  and  refines 
the  grid  in  various  regions  of  the  material  can  be  observed  in  Figures  4  and  5.  The  numerical 
adgorithm  performs  well  in  capturing  the  features  of  the  solution  predicted  by  the  theory. 

Many  of  the  features  of  the  algorithm  can  be  extended  to  two  space  dimensions,  although 
this  is  somewhat  complicated.  The  main  difficulty  in  designing  a  code  in  two  dimensions  is  that 
of  capturing  the  growth  of  the  shear  band  as  the  tip  of  the  shear  band  propagates  across  the 
material.  The  algorithm  in  two  dimensions  is  at  an  advanced  stage  of  development. 
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ABSTRACT.  Where  the  usual  Euclidean  algorithm  finds  a  greatest  commcm 
divisor,  we  describe  a  compositional  Euclidean  algorithm  which  would  give  a  largest 
functional  composition  factor.  And  where  the  usual  Euclidean  algorithm  is  based  on 
repeated  division  with  remainder,  the  compositional  Euclidean  algorithm  is  based  upon 
repeated  compositional  division  with  remainder.  The  challenge  is  that  while  we  propose 
an  apparently  reasonable  notion  of  compositional  division  with  remainder,  we  do  not 
know  if  it  exists.  However,  it  is  easy  to  give  a  number  of  consequences,  assuming  that  it 
does  exist. 

INTRODUCTION.  The  Euclidean  Algorithm  is  a  fundamental  algorithm  for 
manipulating  integers  and  polynomials  in  one  variable.  For  both  integers  and 
polynomials,  the  Euclidean  algorithm  comes  down  to  suitably  repeated  division  with 
remainder.  In  both  settings  the  Euclidean  algorithm  produces  a  greatest  common  divisor 
(GCD)  of  two  initial  elements.  In  ideal  theoretic  terms,  the  Euclidean  algorithm  produces 
an  element  which  generates  the  same  ideal  as  the  ideal  generated  by  the  two  initial 
elements.  For  an  ideal  with  a  given  finite  number  of  generators,  apply  the  Euclidean 
algorithm  repeatedly  to  produce  a  single  element  which  generates  the  same  ideal. 

The  rational  function  field  in  one  variable  -  denoted  k(X)  -  consists  of  quotients 
of  polynomials  in  the  variable  X  with  the  usual  arithmetic  identities  among  firactions. 

The  polynomials  have  coefficients  in  a  field  k  which  we  refer  to  as  the  base  field.  k(X) 
has  many  proper  subfields  containing  k,  for  example,  consider  all  fractions  where  the 
numerator  and  denominator  are  polynomials  with  only  even  degree  terms.  This  subfield 
is  generated  as  a  subfield,  not  as  an  ideal,  by  X^  and  is  naturally  denoted  k(X^). 

Luroth’s  theorem  [1,  p.522]  says  that  if  B  is  a  subfield  of  k(X)  containing  k  then  B  is 
generated  over  k  by  a  single  element;  i.e.  B  =  k(b)  for  some  suitable  element  b.  This  is 
the  starting  point  of  the  present  paper.  It  is  natural  to  consider  the  possibility  of  an 
algorithm  A  in  the  spirit  of  the  Euclidean  algorithm  which  does  the  following:  given  two 
rational  functions  b  and  c,  the  algorithm  produces  d  where  k(b,c)  =  k(d) .  We  wish  to 
emphasize  that  we  are  not  trying  to  produce  the  first  algorithmic  solution  to  this  problem, 
rather  we  are  trying  to  stimulate  new  ways  of  working  with  rational  functions.  For  A  to 
be  in  the  spirit  of  the  Euclidean  algorithm  it  must  be  based  on  an  algorithm  B  which  plays 
the  role  that  division  with  remainder  plays  in  the  Euclidean  algorithm. 

Before  presenting  greater  detail  about  the  possible  form  of  A  and  B,  here  is 
another  view  of  the  problem.  In  this  view  functional  composition  is  the  analog  to  product 
of  integers  or  polynomials.  Of  course  product  is  commutative  and  composition  is  not. 
That  is  why  in  a  composite  such  as  f(g(X))  we  refer  to  f  as  the  outer  compositional 
element  and  g  as  the  inner  compositional  element  (ICE).  The  reason  to  switch  from 
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product  to  composite  results  from  the  difference  between  ideal  generation  and  subfield 
generation.  The  subfield  generated  by  d  consists  of  rational  functions  which  can  be 
expressed  f(d)/g(d)  where  f  and  g  are  polynomials  in  one  variable,  f/g  is  a  rational 
function  r  in  one  variable.  Hence  the  subfield  consists  of  all  rational  functions  which 
have  d  as  ICE  in  some  rational  function  decomposition.  This  is  the  analog  to  an  ideal 
generated  by  a  single  element.  The  subfield  generated  by  b  and  c  consists  of  rational 
functions  which  can  be  expressed  F(b,c)/G(b.c)  where  F  and  G  are  polynomials  in  two 
variables.  F/G  is  a  rational  function  R  in  two  variables.  If  k(b,c)  =  k(d)  it  follows  that 
there  are  rational  functions  r^  and  r2  of  one  variable  and  a  rational  function  R  of  two 
variables  where; 

1  b  =  rj(d)  c  =  r2(d)  d  =  R(b,c) 

The  frrst  two  equations  show  that  d  ICE’s  both  b  and  c,  meaning  both  b  and  c  have  d  as 
an  ICE.  The  third  equation  implies  that  any  ICE  of  b  and  c  also  ICE’s  d.  I.e.  if: 

2  b  =  Sj(e)  and  c  =  S2(e)  then  d  =  R(Sj(e),S2(e))  =  R(sj,S2)(e) 

Thus  d  is  a  largest  ICE  (LICE)  of  b  and  c  where  LICE  has  roughly  the  defining  property 
with  respect  to  composition  as  GCD  has  with  respect  to  product.  In  fact,  (1)  consists  of 
the  main  three  equations  for  GCD  with  composition  replacing  product. 

SECTION  ONE.  Now  let  us  work  toward  the  possible  form  of  B,  the 
replacement  for  division  with  remainder.  In  both  integer  and  polynomial  division  the 
remainder  is  smaller  than  the  divisor.  With  integers  smaller  is  with  respect  to  magnitude. 
With  polynomials  smaller  is  with  respect  to  degree.  In  order  to  give  a  compositional 
analogue  to  division  with  remainder,  we  require  a  suitable  notion  of  smaller.  Suppose  a 
is  a  non-zero  integer.  The  magnitude  of  a  equals  the  cardinality  of  the  set:  the  integers 
modulo  the  ideal  generated  by  a.  If  a(X)  is  a  non-zero  polynomial,  the  degree  of  a  equals 
the  dimension  as  a  vector  space  over  k  of  k[X]  modulo  the  ideal  generated  by  a(X).  In 
both  cases  the  size  of  a  is  determined  by  a  relative  measure  of  the  ideal  generated  by  a  to 
the  entire  ring.  We  use  such  a  measure  for  rational  functions.  If  a(X)  is  a  non-constant 
rational  function,  the  size  of  a  is  defined  to  be  the  dimension  of  k(X)  as  a  vector  space 
over  the  subfield  k(a(X));  i.e.  [k(X);k(a(X))].  This  integer  is  called  the  ice  degree  of 
a(X).  By  [1,  p.520,  thm  8.38]  if  a(X)  is  written  in  the  form  f(X)/g(X)  with  f  and  g 
relatively  prime  polynomials,  the  maximum  of  the  usual  polynomial  degrees  of  f  and  g 
equals  [k(X):k(a(X))].  Hence,  the  ice  degree  is  also  determined  directly  from  a(X)  and 
is  not  only  a  relative  concept. 

If  a  is  zero  the  cardinality  of  the  integers  modulo  the  ideal  generated  by  a  is 
infinity.  Same  for  dimension  over  k  of  k[X]  modulo  the  ideal  generated  by  a(X)  when 
a(X)  is  the  zero  polynomial.  The  magnitude  or  degree  is  in  some  sense  opposite  to  the 
size  of  the  quotient  in  these  exceptional  cases.  Ice  degree  exhibits  the  same  anomaly. 
When  a(X)  is  a  constant  rational  function,  k(a(X))  =  k  so  that  [k(X):k(a(X))]  is  irrfinity. 
The  ice  degree  of  a  constant  rational  function  is  zero  by  convention,  which  accords  with 
the  maximum  of  the  usual  polynomial  degrees  of  the  numerator  and  denominator. 

The  defining  formula  for  division  with  remainder,  dividing  a  by  p,  is;  a  = 
r  ♦  P  +  p  ,  where  F  is  the  quotient  and  p  the  remainder.  A  naive  compositional  analog  to 
division  with  remainder  is  given  by:  a  =  f2(P)  +  p  .  We  have  replaced  the  product  by 
composition,  but  the  sum  "+"  and  general  shape  of  the  equation  is  unchanged.  Let  us 
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take  a  slightly  more  sophisticated  approach.  Division  with  remainder  is  effective  in  the 
realm  of  ideid  theory.  The  reason  is  because  an  equation  of  the  form:  is 

universally,  arithmetically  reversible  to  give  p  =  a  -  F  *  a .  Consequently  the  ideal 
generated  by  a  and  P  equals  the  ideal  generated  by  P  and  p.  This  is  the  key  property  upon 
which  the  Euclidean  algorithm  is  based  and  which  we  must  preserve.  In  other  words,  if 
compositional  division  of  a  by  P  is  to  produce  an  ice  remainder  p,  the  process  ought  to  be 
universally,  compositionally  reversible  so  that  k(a,P)  =  k(P,p).  Once  we  achieve  this,  the 
compositional  Euclidean  algorithm  follows.  Let  us  play  with  this  condition:  k(a,P) » 
k(p,p).  Collecting  a  common  k(P)  we  are  asking:  k(P)(a)  =  k(P)(p) .  Fractional  linear 
transformations  are  automorphisms  of  function  fields.  It  is  natural  to  seek  a,b,c,d  in  k(P) 
where: 

3  ad  -  be  is  non-zero  and  the  ice  degree  of  ( aa  -t-  b )  /  ( ca  +  d) 
is  less  than  the  ice  degree  of  p.  Let  p  denote  ( aa  +  b )  /  ( ca  +  d) . 

The  non-zero  (determinant)  condition  on  ad  -  be  insures  that 

4  ( ep  +  f )  /  ( gp  h)  =  a  for  suitable  e,f,g,h  in  k(P) ;  hence  k(P)(a)  =  k(P)(p) 

DEFINmON  Given  rational  functions  a  and  P  where  p  has  ice  degree  at  least  one,  the 
compositional  division  of  a  by  p  consists  of  a,b,c,d  in  k(P)  satisfying  (3).  For  such  a 
compositional  division,  p  is  the  ice  remainder.  The  fractional  linear  transformation  and 
rational  function:  (aX  +  b)/(cX  +  d)  is  the  outer  compositional  quotient  of  a  by  p. 

Here  are  three  questions  concerning  compositional  division,  the  first  of  which  is: 
is  compositional  division  always  possible?  For  example,  suppose  P  has  ice  degree  at 
least  one  and  a  does  not  lie  i.:  k(p).  In  this  case  k(P)(a)  properly  contains  k(P).  By 
Luroth’s  theorem,  k(P)(a)  is  generated  by  a  single  element  p.  Since  k(p)  =  k(P)(a)  which 
properly  contains  k(P),  it  follows  that  p  has  lower  ice  degree  than  p.  It  is  clear  that  p  can 
be  written  as  a  polynomial  in  a  with  coefficients  from  k(p)  but  can  p,  or  some  other 
element  of  k(P)(a)  with  lower  ice  degree  than  P,  be  written  as  a  fractional  linear 
transformation  of  a  with  coefficients  from  k(p)?  The  next  two  questions  assume  that 
compositional  division  is  possible.  In  this  case:  a.  give  a  beautiful  algorithm  for 
compositional  division,  or  at  least  for  finding  p;  b.  give  an  efficient  algorithm  for 
compositional  division,  or  at  least  for  finding  p. 

We  conclude  by  deriving  consequences  of  compositional  division,  including  the 
compositional  Euclidean  algorithm. 

LE]!|(IMA  Given  rational  functions  a  and  p  where  P  has  ice  degree  at  least  one,  the 
corripositional  division  of  a  by  p  exists  and  has  ice  remainder  in  k  if  and  only  if  a  lies  in 
k(P).  In  this  case  any  element  of  k  can  be  achieved  as  an  ice  remainder. 

PROOF  Say  a  lies  in  k(P)  and  s  is  any  element  of  k.  In  (3)  let  a  =  1,  b  =  a(s  - 1),  c  =  0 
and  d  =  a.  Then  (3)  is  satisfied  and  p  =  s.  Conversely  suppose  the  compositional 
division  exists  with  ice  remainder  p  in  k.  From  (4)  and  that  p  lies  in  k  it  follows  that  a 
lies  in  k(p).  qed 

Let  us  now  assume  that  compositional  division  does  in  fact  exist  and  show  how  to  utilize 
it  in  the  manner  that  ordinary  division  is  utilized. 
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PROPOSITION  Let  K  be  a  subfield  of  k(X)  which  properly  contains  k.  Then  K 
contains  elements  of  non-zero  ice  degree.  If  P  is  an  element  of  K  with  smallest  non-zero 
ice  (tegree  then  K  =  k(P). 

PROOF  Any  element  of  K  lying  outside  of  k  has  non-zero  ice  degree.  Let  p  be  any  such 
element  with  minimal  ice  degree.  Let  a  be  any  element  of  K.  p  the  ice  remaiiMler  ^m 
compositional  division  of  a  by  P  has  smaller  ice  degree  than  p.  By  the  minimality 
property  of  P,  it  follows  that  p  has  ice  degree  zero  and  hence  lies  in  k.  By  the  previous 
lemma  it  follows  that  a  lies  in  k(P).  qed 

COMPOSITIONAL  EUCLIDEAN  ALGORITHM  Staning  with  rational  functions  Oq 
and  construct  the  sequence  aQ,aj,a2,...>aj^  where  is  the  ice  remainder  of 
compositional  division  of  process  continues  until  reaching  of  ice 

degree  zero.  We  allow  the  possibility  that  n  =  1  and  no  division  is  performed.  Then 
is  the  ICE  of  Oq  and  . 

PROOF  At  each  stage  k(aQ,a|)  =  k(aj  ,02)  = ...  =  k(aj^_j,ajj)  =  kfotj^.j)  where  the  last 
duality  follows  since  lies  in  k.  qed 
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Abstract 

In  this  iM|Mr,  we  define  Hyperbolic  Cascaded  Heed-Solomon  (HCRS)  codes,  study  thw  al¬ 
gebraic  prop^ies,  and  describe  an  algorithm  for  decoding  thmn  up  to  th«r  fuU  error-correcting 
Ci4>abibty.  The  codewords  of  HCRS  codes  are  represented  either  as  arrays  or  as  bivariate  poly¬ 
nomials.  Our  decoding  algorithm  is  an  extension  of  an  algorithm  of  Sakata,  and  the  decoding 
is  performed  1^  calculating  a  Grobner  basis  for  an  ideal  related  to  the  error  locations. 

Section  1.  Linear  Block  Codes 

We  begin  with  a  brief  introduction  to  the  theory  error-correcting  codes,  and  in 
particular,  linear  block  codes  [1,  6]. 

Error-Correcting  Codes.  Error-correcting  codes  are  used  to  protect  digital  in¬ 
formation  during  transmission  across  a  noisy  channel.  Applications  include  modem  com¬ 
munications  over  a  phone  line,  radio  commtmications  with  satellites  ot  spacecraft,  and 
computer  storage  devices.  In  each  case,  noise  is  introduced  to  the  data  transmitted  so  that 
the  bits  received  may  not  be  the  same  as  the  bits  transmitted.  An  error-correcting  code 
adds  redundancy  to  the  information  to  create  codewords  which  may  be  reconstructed  by 
the  receiver  even  if  some  of  the  bits  are  in  error. 

Linear  Block  Codes.  Let  9  be  a  power  of  a  prime  number.  We  use  the  elements 
of  F,,  the  finite  field  with  q  elements,  asus  the  symbols  of  an  alphabet  used  to  compose 
codewords.  Usually,  9  is  a  power  of  2  so  that  sjrmbob  can  be  expressed  as  strings  of  bits. 
The  encoder  takes  a  word  of  length  k  and  encodes  it  as  a  codeword  of  length  n>  k.  The 
encoder  thus  is  a  one-to-one  map  FJ  — ►  F^.  The  code  C  is  the  set  of  all  codewords;  that 

is,  the  image  of  F*  under  the  encoding  map.  We  require  the  encoder  to  be  a  linear  ma,^  of 
vector  spaces  over  F^,  so  that  C  is  a  linear  block  code.  In  this  case,  C  is  a  k-dimensional 
subspace  of  F^ . 

Hamming  Distance.  The  Hamming  weight  ||u7||h  of  a  word  u;  €  F”  is  the  number 
of  nonzero  entries  of  w.  The  Hamming  distance  dff(v,  w)  between  two  elements  v,  u;  €  F^ 
is  defined  to  be  j|t;  —  that  is,  the  number  of  entries  in  which  they  disagree.  The 

minimum  distance  of  a  code  C  is  the  TniniTnnm  Hamming  distance  between  any  two  distinct 
codewords  of  C. 

Correction  of  Errors.  Suppose  C  is  a  linear  block  code  with  minimum  distance 
at  least  2t  -f-  1.  A  codeword  c  €  C  is  sent  through  the  channel  and  some  of  its  entries 
are  sJtered.  Thus  the  receiver  receives  a  word  w  G  F^  which  is  the  sum  c  -t-  c  of  the 
codeword  and  an  error  word  c  €  FJ.  The  number  of  places  in  which  c  has  been  corrupted 

This  work  was  supported  by  the  U.S.  Army  Research  Office  through  the  Army  Center  of  Excellence 
for  Symbolic  Methods  in  Algorithmic  Mathematics  (ACSyAM),  Mathematical  Sciences  Institute  of  Cornell 
University.  Contract  DAAL03-92-G-0126. 
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is  expressed  as  the  Hamming  distance  dH{va,c)  =  |)e{j/f.  Assuming  that  no  more  than  t 
errors  occurred,  we  have  dfjiw^c)  <  t,  and  c  is  the  unique  codeword  with  this  property 
since  by  the  triangle  inequality,  dnitOycf)  >  <  + 1  few  all  other  codewords  c* . 

Thus  a  code  C  with  TniniTninn  distance  2t  +  1  is  a  i-ctror  correcting  code,  that  is, 
any  pattern  of  up  to  t  errors  may  be  corrected.  The  task  of  the  decoder  is  to  produce 
the  codeword  nearest  in  Hamming  distance  to  the  received  word.  However,  the  numbor 
of  codewords  is  exponential  in  the  blocklength,  as  well  as  the  number  of  correctable  error 
patterns,  so  finding  an  efficient  decoding  algorithm  is  problematic. 

Parameters  of  a  Linear  Block  Code.  An  (ra,  k,  d)  linear  block  code  C  is  a  Ar- 
dimensional  subspace  of  with  minimnm  distance  d.  The  parameter  n  is  called  the 
blocklength  of  the  code.  The  parameter  k  is  called  the  dimension  of  the  code.  The  rate 
of  the  code  is  the  ratio  k/n,  since  each  word  of  k  symbols  is  encoded  as  a  word  of  n 
symbols.  The  parameter  d  is  the  minimum  distance  of  the  code.  We  have  seen  that  a 
code  with  minimum  distance  d  can  correct  up  to  <  =  errors.  In  choosing  a  code 

for  transmission  of  data  across  a  ^ven  channd,  there  must  be  some  tradeoff  between  the 
conflicting  goals  of  high  rate  and  high  error-correcting  capability.  We  must  also  keep  in 
mind  that  the  code  ought  to  have  an  efficient  decoding  algorithm. 

Section  2.  Hyperbolic  Cascaded  Reed-Solomon  Codes 

Hyperbolic  Cascaded  Reed-Solomon  (HCRS)  codes  have  been  studied  in  [4,  5,  7,  10]. 
We  give  an  algebraic  description  of  HCRS  codes  here,  but  they  were  originally  introduced 
in  [10]  using  the  cascade  code  construction  of  Blokh  and  Zyablov  [2].  HCRS  codes  are 
in  many  ways  a  genet alization  of  the  widely-used  Reed-Solomon  (RS)  codes  [1,  6].  One 
motivation  for  using  HCRS  codes  is  their  long  blocklengths:  whereas  RS  codes  over  the 
alphabet  F,  are  limited  to  blocklengths  q,  HCRS  codes  have  blocklengths  fa  q^. 

Notation.  The  set  of  nonzero  elements  of  F,  forms  a  cyclic  group  under  multipli¬ 
cation.  Let  n  =  g  —  1  be  the  order  of  the  cyclic  group  F*  =  F,  \  {0},  and  choose  o  €  F, 
to  be  a  generator  of  this  group.  Thus  F*  =  {l,a,a*, •  •  •  Let  F^^"  be  the  set 

of  n  X  n  arrays  with  entries  from  the  the  field  F,,  and  let  F”’‘”[x,y]  denote  the  set  of 
bivariate  polynomials  /  €  F,[x,y]  with  deg^/  <  n  and  deg^/  <  n.  We  identify  F^^^" 
with  F”’^"[x,y]  by  identifying  a  polynomial  a(x,  y)  with  the  array  a  of  its  coefficients: 

aoo 

•  • 

Qn-l.O  ••• 


ao,n-i 

®n— l,n— 1 


n— 1  n— 1 

v)  = 

(=0  >=0 

The  Fourier  lYansform.  The  Fourier  transform  is  a  one-to-one  linear  map  from 
F"’^"[x,  y]  (the  time  domain)  to  F"^"[A’,  Y]  (the  frequency  domain).  (Of  course,  these 
two  spaces  are  isomorphic,  but  we  express  the  polynomials  in  different  sets  of  variables  in 
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wder  to  distmguish  them.)  The  Fourier  transfcmn,  a  *-*  A,  and  its  inverse  are  given  by 
the  fbnnulas: 

A{X,  =  a>)JC‘K>, 

tsO  jsO 


y)=i^5]]A(a  a  ^)x‘y^ 

”  i=o  >=o 

Note  that  Aij  =  a(a',  a^),  so  is  a  table  of  the  values  of  a(x,  y).  The  properties  of  the 
Fourier  transform  are  studied  in  detail  in  [1]. 

2-D  Cyclic  Codes  [7,  9].  Let  P  C  be  a  set  of  “frequencies”.  A  2-D  cyclic  code 
is  defined  by  taking  as  codewords  those  words  a  €  FJ  whose  Fourier  transform  has  zero 
entries  in  the  positions  specified  by  P. 


Cp  =  {a  €  F"^**  :  Aij  =  0  for  each  (i,;)  €  P}. 

The  blocklength  of  this  code  is  N  —  n*,  since  it  is  a  snbspace  of 

Hyperbolic  Cascaded  Reed-Solomon  Codes.  For  arbitrary  choices  of  the  coo- 
straint  set  P,  the  minimum  distance  of  the  2-D  cyclic  code  Cp  may  not  be  very  good  (and 
difficult  to  determine).  We  define  Hyperbolic  Cascaded  Reed-Solomon  codes  which  can 
be  shown  to  have  good  distance  properties.  Given  a  parameter  d,  the  desired  minimum 
distance,  we  define  the  frequency  set 

Pd  =  {(*,»  €  :  (i  -I-  l)(j  -I- 1)  <  d,  »  <  n,  j  <  n}  . 

Then  we  define  the  Hyperbolic  Cascaded  Reed-Solomon  code 

HCRSd  =  Cpj  =  {a  €  :  Aij  =  0  whenever  (i  -|-  l)(j  +  1)  <  d}. 

The  parameters  of  the  code  HCRSrf  are  {N  =  n^,  k  =  —  |Prf|,  d).  The  dimension 

is  —  \Pd\  since  each  constraint  imposed  by  an  element  of  Pd  is  independent,  and  the 
minimum  distance  can  be  shown  to  be  at  least  d,  the  designed  minimum  distance  [7]. 

Examples  of  HCRS  codes.  These  ideas  are  best  understood  by  looking  at  an 
example.  Consider  the  (49,  35,  7)  code  HCRS?  over  Fg.  The  frequency  set  is  given  by 

P,  =  {(i,  j)  e  Zi  :  (i +  1)0+1)  <7} 


=  {(0,0),(0,1).(0,2),(0,3),(0,4),(0,5),(1,0),(1,1),(1,2),(2,0),(2,1),(3,0),(4,0),(5,0)) 


35 


A  7  X  7  array  (a,j)  is  a  codeword  if  and  only  if  its  transform  Aij  has  the  form: 
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codeword  a  (time  domain)  its  transform  A  (frequency  domain) 


This  diagram  also  suggests  a  method  of  encoding:  a  message  consisting  of  35  s}rmbols 
could  be  entered  as  the  35  free  entries  of  the  transform  array  A,  and  the  inverse  transform 
could  then  be  applied  to  obtain  a  codeword.  Thus  we  have  a  linear  map  Fg*  — *■  Fg®  whose 
image  is  HCRS7. 

HCRS  codes  in  m  dimensions.  Consider  m-dimensional  n  x  n  x  •  •  •  x  n  arrays  over 
F,.  Use  the  m-dimensional  Fourier  transform  to  define  codes  with  blocklength  iV  =  n"*. 
The  one-dimensional  version  of  a  HCRS  code  is  the  well-known  Reed-Solonxjn  code.  For 
example: 

Reed-Solomon:  RS<j  =  {a  €  F^  :  A,  =0,  (i  -f  1)  <  d} 

2- DHCRS:  HCRS^  =  {a  €  F^^"  :  =  0,  {i +  1)  {j  +  1)  <  d} 

3- D  HCRS:  HCRS^  =  {aeF”"""^":  A;,*  =  0,  {i  +  \)  {j  +  1)  {k  +  1)  <  d} 

Section  3.  Decoding  HCRS  Codes 

We  have  developed  an  algorithm  for  correcting  HCRS  codes  up  to  their  fuU  error- 
correcting  capacity.  That  is,  our  algorithm  corrects  any  pattern  of  t  errors  for  the  code 
HCRSat+i-  The  algorithm  is  given  in  [7].  We  sketch  here  some  of  the  underlying  ideas. 

The  Syndrome  Array.  From  now  on,  we  will  be  considering  the  code  HCRS21+1. 
A  codeword  c  €  HCRS2f+i  is  sent  through  the  channel.  An  error  e  €  F”^**  may  be 
introduced,  restilting  in  a  received  word  which  is  the  stun  ?i7  =  c  -1-  e.  Recall  that  the  task 
of  the  decoder  is  to  determine  the  imique  codeword  c  which  differs  from  w  in  at  most  t 
entries.  Apply  the  Fourier  transform:  Wij  =  Cij  -Eij,  for  all  (i,j).  The  transform  E  of 
c  is  the  syndrome  array.  Note  that  for  each  (i,j)  €  R2t+i,  an  entry  Eij  of  the  syndrome 
array  is  known  to  the  decoder,  since  C,^=0  and  therefore  Wij  =  Eii.  Thus  the  syndrome 
array  is  partially  known,  and  the  decoding  problem  may  be  reformulated:  the  decoder 
must  find  the  imique  completion  of  the  partially-known  syndrome  array  to  a  full  n  x  n 
array  which  is  the  transform  of  an  error  pattern  with  weight  t  or  less. 
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2-D  Linear  Recursion  Relations.  Extend  the  syndrome  array  ^  to  an  infinite 
array: 

Eij  =  e{a\a^),  for  ail (i, j)  6  Zj.. 

Since  a”  =  1,  this  array  is  doubly  periodic: 

Ei+n,j  —  Eijy  Ei^j^n  —  Eij 

This  is  an  example  of  a  two-dimensional  linear  rectirsion  (2-D  LR)  relation  [8].  In  general, 
if 

Y,  fr.  Ei+r,>+.  =0,  for  da  (i,i)  6  Z*, 

r  a 

we  say  that  E  satisfies  a  i~D  LR  relation  with  coefficients  frs.  The  characteristic  polyno¬ 
mial  of  this  relation  is: 

/(a?.y)  =  Xm  y*- 

r  a 

As  mentioned  above,  the  periodicity  conditions  Ei+n,i  =  Etj  and  Eij+n  =  Eij  are 
examples  of  2-D  LR  relations  which  are  satisfied  by  every  syndrome  array.  These  relations 
have  3?“  —  1  and  y“  —  1  as  their  characteristic  polynomials. 

The  Error  Locator  Ideal.  Consider  the  error  polynomial  c(x,  y)  =  Ylj  ^ij  V^- 
For  an  error  pattern  which  is  correctable,  there  are  at  most  t  coefficients  Cij  which  are 
nonzero.  For  each  nonzero  e,j, 

(i,j)  is  an  error  location 
Cij  is  an  error  value. 

The  following  theorem  connects  the  error  locations  with  the  set  of  2-D  LR  relations  valid 
on  the  syndrome  array. 

Error  Location  Theorem  (Sakata  [9]). 

F(X,  Y)  is  a  2-D  LR  relation  valid  on  E 


F{a*,a^)  =  0  for  each  error  location  (t,i). 

This  motivates  us  to  define  the  error  locator  ideal: 

L  =  {F  €  F,[X,  y] :  F  is  a  valid  2-D  LR  relation  on  E} 

=  {F  €  F,[X,y]  :  F  vanishes  at  each  (a‘,a^)} 

To  identify  the  error  locations,  we  seek  a  Grobner  basis  [3]  which  generates  L: 

L  =  (F,  {X,  y ),  F2ix,  y ),  •  • . ,  Ft{x,  y )). 

(In  the  case  of  Reed-Solomon  codes,  we  sure  considering  an  ideal  in  the  ring  of  polynomials 
in  one  variable.  In  this  case,  the  ideal  will  always  be  principal:  L  =  (A(X)),  and  here 
A(X)  is  the  error-locator  polynomial  as  studied  in  the  theory  of  Reed-Solomon  codes  [1, 
6].) 
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Overview  of  Decoding  Algorithni  [7]. 

1.  Find  2-D  LR  relations  satisfied  by  the  syadrome  array.  (Form  a 
Grobner  basis  for  the  error  locator  ideal  L.)  There  are  two  basic  op¬ 
erations: 

a.  Testing  a  2-D  LR  relation  to  see  if  it  correctly  predicts  the  value 
of  a  known  syndrome. 

b.  Predicting  the  value  of  an  unknown  syndrome  using  a  2-D  LR 
relation. 

2.  Find  the  common  zeros  of  the  polynomials  in  L.  These  are  the  com¬ 
mon  zeros  of  the  polynomials  in  the  Grobner  basis.  Each  zero  {a*,a^) 
identifies  an  erroi  location  (i,j). 

3.  Interpolate  lo  find  the  error  values.  Subtract  the  error  from  the  re¬ 
ceived  word  to  obtain  the  codeword. 


Example.  We  again  consider  the  (49,  35,  7)  code  over  Fg.  This  code  is  capable  of 
correcting  any  pattern  of  three  errors  or  less.  Choose  a  (the  primitive  7***  root  of  unity)  to 
be  a  solution  of  the  equation  -I-  a  -1- 1  =0.  We  receive  the  following  word,  and  calculate 
the  corresponding  syndrome  array  (the  symbol  denotes  an  tmknown  entry): 
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Received  Word  Syndrome 

The  error  locator  ideal  is  fotmd  to  be 


where 


L  =  (/l,/2,/3), 


/i  =  Y^+a^Y  +  a^ 
f2  =  XY  +  aX  +  a^Y  +  a^ 
h  =  X'^-\-X-^a^Y  +  a^ 


The  corresponding  relations  which  are  satisfied  by  the  syndrome  array  E  are: 


Ei,j+2  +  4-  oi^Eij  =  0 

Ei+i,j+i  +  aEi+ij  -I-  Eij+i  -I-  Eij  =  0 

Ei+2,j  +  Ei+ij  +  Eij+i  4-  Eij  =  0 

We  solve  for  the  three  common  roots  of  the  polynomials  /i ,  /a,  /$: 


(0,0),  (Q*,a),  (o^a^). 
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This  indicates  that  the  error  locations  are  (3,1)  (5,4).  Thus  c(x,y)  =  enxy  + 

e3i**y  +  We  recalculate  the  syndromes  in  terms  of  this  expression  to  solve  for 

the  error  values: 

cn  =  <*31  =  Ij  C54  =  1- 

These  three  values  are  subtracted  from  the  corresponding  entries  of  the  received  word  to 
obtain  the  codeword: 
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Abstract 

This  paper  presents  a  method  for  extracting  programs  from  classical  sequent  proofs  in  propositional 
logic.  The  term  language  is  typed  lambda  calculus  augmented  by  a  nonlocal  control  operator.  The 
control  operator  used  in  this  paper  is  Scheme’s  call/cc  (call-with-current-continuaticm).  An  advantage  of 
using  eall/cc  is  that  a  subject  reduction  theorem  can  be  proved  very  easily  for  this  language. 

One  new  feature  of  this  work  is  the  choice  of  the  classical  sequent  calculus  for  propositional  logic 
as  the  proof  system.  Terms  are  extracted  directly  from  propositional  sequent  proofs,  rather  than  via  a 
translation  to  natural  deduction  proofs.  A  technique  for  representing  continuations  as  partial  proofs  is 
devdoped,  and  reduction  is  described  as  an  operation  ea  psoofo. 


1  Introduction 

In  recent  years,  there  has  been  a  great  deal  of  interest  in  the  computational  content  of  classical  proofs. 
Since  the  discovery  of  a  classical  typing  for  the  control  operator  C  by  Griffin  [5],  a  great  deal  of  work  has 
been  done  in  expressing  the  computations  in  classical  proofs  as  programs  in  a  lambda  calculus  extended  by 
nonlocal  control  operators.  Using  the  principle  of  the  Curry-Howaxd  isomorphism,  if  we  augment  the  typed 
lambda  calculm  with  an  operator  having  the  type  of  a  classical  axiom  (in  particular,  an  axiom  strong  enough 
to  prove  all  of  classical  logic  when  added  to  intuitionistic  logic),  we  will  be  able  to  extract  programs  from 
classical  proofs.  The  question  is  then  whether  these  programs  actually  represent  a  sensible  computation. 

Murthy  answered  this  question  in  part  in  bis  thesis  [6],  where  he  showed  that  a  classical  proof  of  a  II2 
sentence  0  can  be  interpreted  as  a  program  in  an  extended  lambda  calculus  which  meets  the  specification 
0.  This  work  was  done  considering  various  fixed,  normalizing  evaluation  strategies,  however;  the  question 
of  strong  normalization  for  languages  with  control  operators  remains  open.  Barbanera  and  Berardi  proved 
a  strong  normalization  result  for  one  such  language  in  [1],  with  some  substantial  restrictions  on  the  types 
allowed  in  the  language.  Specifically,  their  language  forbids  types  with  strict  subtypes  of  the  form  and 
forbids  ±  from  appearing  on  the  left  hand  side  of  an  implication. 

This  paper  presents  a  method  for  extracting  programs  from  classical  sequent  prooft  in  propositional  logic. 
The  term  language  is  typed  lambda  calculus  augmented  by  a  nonlocal  control  operator.  The  control  operator 
used  in  this  paper  is  Scheme’s  call/cc  (call-with-current-continuation).  An  advantage  of  using  call/cc  is  that 
a  subject  reduction  theorem  can  be  proved  very  easily  for  this  language. 

One  unusual  aspect  of  this  work  is  the  choice  of  the  classical  sequent  calculus  for  propositional  logic  as  the 
proof  system.  Terms  are  extracted  directly  from  propositional  sequent  proofs,  rather  than  via  a  translation 
to  natural  deduction  proofs.  A  technique  for  representing  continuations  as  partial  proofs  is  developed,  and 
reduction  is  described  as  an  operation  on  proofe.  Using  this  proof  structure  leads  to  a  greater  understanding 
of  reduction,  which  may  in  turn  lead  to  a  strong  normalization  proof  for  these  terms. 

*  Supported  in  part  by  the  United  States  Army  Research  Office  through  the  Army  Center  of  Excellence  for  Symbolic  Methods 
in  Algorithmic  Mathematics  (ACSyAM),  Mathematical  Sciences  Institute  of  Cornell  University.  Contract  DAAL  03-91-C-0027, 
and  in  part  by  a  National  Science  Foundation  Graduate  Fellowship. 


41 


The  work  began  as  an  attempt  by  the  author  to  understand  exactly  how  cut  elimination  as  reduction 
corresponds  to  normalization  of  natural  deduction  proofs,  which  was  inspired  in  part  Girard’s  work  [4] 
on  constructive  interpretations  of  classical  logic,  and  in  part  by  considering  an  extension  of  the  author’s 
previous  work  on  the  tableau  algorithm  for  intuitionistic  propositional  calculus  ([7]). 

The  rest  of  the  paper  describes  the  details  required  for  the  proof.  First,  we  define  term  language  we  use 
to  express  the  computations  involved.  Then,  an  outline  of  the  method  of  extraction  of  a  term  from  a  sequent 
proof  is  given.  Next,  we  extend  the  computation  system  by  the  addition  of  typed  constants,  and  we  define 
the  reduction  rules  for  program  terms  built  from  terms  extracted  from  proofe  and  these  constants.  (These 
rules  are  actually  described  in  the  form  of  a  term  rewriting  machine.)  The  remainder  of  the  paper  describes 
the  correspondence  between  reduction  and  cut  elimination. 

2  Term  Language  for  Classical  Logic 

Each  rule  of  the  classical  sequent  calculus  will  correspond  to  the  construction  of  a  term  in  a  term  language 
based  on  typed  lambda  calculus  plus  call/cc,  a  nonlocal  control  operator.  Classical  typings  for  control 
operators  were  discovered  by  Griffin  and  developed  by  Murthy  ([5,  6]).  In  particular,  call/cc  can  be  given 
the  type  {->P-*P)-*P,  a  classical  tautology  which  is  a  form  of  Peirce’s  law.  The  results  given  here  could 
also  be  expressed  in  a  term  language  which  uses  the  control  operator  C  and  the  abort  operator  A  with  their 
operational  semantics  as  described  in  [3],  at  the  expense  of  increased  complexity  of  the  proofs.  (To  make 
this  translation  use  the  definition 

caU/cc(M)  =  {C\k.k{Mk)) 

as  in  ([3]);  this  definition  also  justifies  the  typing  of  call/cc  if  we  consider  the  usual  typing  of  C  as  -t{->P)-*P.) 

The  basic  term  language,  with  the  exception  of  call/cc,  is  a  subset  of  the  Nuprl  term  language  ([2]).  We 
use  logical  notation  for  types,  and  for  each  type,  we  describe  how  to  build  terms  which  inhabitant  that  type 
from  inhabitants  of  its  constituent  types.  Such  terms  are  called  canonical  because  they  are  not  themselves 
reducible  (though  their  subterms  may  be). 

The  types,  and  some  of  the  terms,  are  defined  as  follows: 

•  Propositional  letters  (p,q,r...)  are  types,  and  their  canonical  inhabitants  are  variables  (x,  y,  z,  ...) 
of  the  appropriate  type. 

•  If  Pi  and  P2  are  types,  then  Pi  A  P2  is  a  type.  If  f  1  and  f2  are  terms  of  type  Pi  and  P2  respectively, 
then  pairCf  1  ,f2)  is  a  canonical  inhabitant  of  Pi  A  P2.. 

•  If  Pi  and  P2  are  types,  then  Pi  V  P2  is  a  type.  If  f  1  and  f  2  terms  of  type  Pi  and  P2  respectively, 

then  inl(fi)  and  inr(f2)  are  canonical  inhabitants  of  Pi  V  P2. 

•  If  Pi  and  P2  are  types,  then  Pi  -►P2  is  a  type,  and  Ax .  f  2  is  a  canonical  inhabitant  of  it  if  x  is  a  variable 
of  type  Pi. 

•  ±  is  a  type,  and  it  has  no  canonical  inhabitants. 

The  type  P— will  be  abbreviated  -iP. 

Also  in  the  language  are  the  following  term  constructors: 

•  If  t  is  a  term  of  type  (-iP)— »P,  then  call/cc (t)  is  a  term  of  type  P. 

•  If  t  is  a  term  of  type  ±,  then  any^(t)  is  a  term  of  type  P.  The  superscript  will  be  omitted  when  it  is 
deducible  from  context. 

•  If  t  is  a  term  of  type  Pi  V  P2,  and  ti  is  a  term  of  type  P  with  free  variable  u  of  type  Pi,  and  t2  is  a 
term  of  type  P  with  free  variable  v  of  type  P2,  then  decideCt;  u.ti ;  v.t2)  is  a  term  of  type  P. 

•  If  t  is  a  term  of  type  Pi  A  P2,  and  tiis  a  term  of  type  P  with  free  variables  u  of  type  Pi  and  v  of  type 
P2,  then  8pread(t;u,v.ti)  is  a  term  of  type  P. 
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•  If  f  is  a  term  of  type  P\-*Pi,  and  t  is  a  term  of  type  Pi,  then  apply (f  :t)  is  a  term  of  type  Pj.  This 
term  will  often  be  abbreviated  f  (t). 

Some  abbreviations  will  be  helpful  in  discussing  multiple  disjunctions,  as  j4i  V . . .  V  An-  We  shall  parse 
this  as  i4i  V  (iis  V  (. . .  (v4„-i  Vv4n))),  and  we  shall  use  ini(z)  as  an  abbreviation  for  the  appropriate  sequence 
of  ini  and  inr  applications  such  that  if  z  :  ^4^,  then  inj(z) :  Ai  V  . . .  V  An-  To  generalize  the  decide  term 
constructor  to  handle  multiple  disjunctious,  assume  we  have  terms  ti . . .  t„  of  type  P  such  that  each  ti  has 
free  variable  u^  of  type  Ai,  and  we  have  t  of  type  j4i  V  . . .  V  An.  Then  the  term  decide„(t;ui  .ti ;  . . .  ; 
Un  .tn)  has  type  P,  and  is  considered  to  be  an  abbreviation  of  the  appropriate  sequence  of  ordinary  decide 
constructions. 

3  Classical  Sequent  Proof  Terms 

For  each  rule  of  the  classical  sequent  calculus,  we  describe  how  the  computational  content  of  the  hy¬ 
pothesis  of  the  sequent  is  transformed  to  computational  content  of  the  conclusion.  In  general,  a  sequent 
....  Am  Bi,. . .  ,Bn  will  represent  the  type  Ai-^{A2-* . . .  (Am-*Bi  V  ...  V  Bn)),  and  so  its  computa¬ 
tional  content  will  be  a  term  of  that  type.  Thus,  we  will  assign  a  variable  to  each  formula  on  the  left  of 
the  turnstile,  and  the  inhabitant  of  the  (implicit)  disjunction  on  the  right  will  be  constructed  from  these 
variables  and  the  term  constructors.  The  computational  content  of  the  sequent,  then,  is  the  closed  A-term 
for  the  function  involved. 

For  all  sequent  rules  but  one,  we  shall  see  that  if  if  we  have  a  constructive  proof  of  each  hypothesis 
sequent,  we  will  also  have  a  constructive  proof  of  the  conclusion  of  the  rule.  This  is  true  despite  the  fact  that 
we  may  have  more  than  one  formula  on  the  right  side  of  the  turnstile.  For,  if  a  sequent  F  I-  A  is  constructively 
provable,  given  proofs  of  the  formulas  in  F  we  will  be  able  to  determine  which  formula  in  A  is  proved.  This 
is  usually  enough  to  allow  us  to  determine  which  formula  on  the  left  of  the  conclusion  sequent  is  proved. 
The  exception  is  the  rule  of  arrow  introduction  on  the  left: 

r,v4l-P,A 

ri-A-^B,A 

Since  we  may  have  satisfied  the  hypothesis  sequent  by  proving  something  in  A,  and  we  may  have  used  the 
assumption  A,  we  cannot  be  certain  that  we  can  still  prove  anything  in  A.  Nor  can  we  prove  A-*B,  since 
we  may  not  have  proved  B  in  the  hypothesis  sequent.  It  is  only  in  the  computational  content  of  this  sequent 
that  the  call/cc  operator  appears. 

Extracting  terms  from  proofs 

The  rest  of  this  section  describes  how  to  construct  a  term  representing  the  computational  content  of  a  sequent 
proof. 

The  computational  content  of  an  axiom 

X  :  yll-x  :  A 

is  the  term  Ax .  z,  the  identity  function.  A  more  general  form  of  this  axiom  is 

•  All . . . ,  tn  :  An  I”  tj  :  Ai 

and  its  computational  content  is  the  term  Azi. . .  Axn  .x<. 

Now  assume  that  we  are  given  functions  representing  the  computational  content  of  the  sequents  which 
are  the  premises  of  each  sequent  rule.  We  use  these  functions  to  construct  the  computational  content  of  the 
conclusion.  The  notation  is  extended  so  that  f  :  FH  A  means  that  if  F  =  Ai, . . . ,  An  and  A  =  Si, ... ,  Bn,  f 
is  a  function  with  type  (Ai— »(A2 . . .  (An— >(Si  V ...  V  Bn))))-  No  ambiguity  will  arise,  since  the  term  is  read 
this  way  only  if  there  is  no  term  explicitly  shown  for  the  formula  on  the  right  side  of  the  turnstile. 

Since  sequent  rules  may  involve  lists  of  formulas  F  =  Ai,...  ,An  which  are  carried  from  hypothesis  to 
conclusion  without  modification,  we  will  often  abbreviate  the  list  gi  :  Ai , . . . ,  gn  :  A„  as  simply  g  :  F. 
Thus,  in  the  terms  below,  Ag.t  is  to  be  considered  an  abbreviation  for  Agi...  Agn.t.  We  also  abbreviate 
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((((f(gi))g3)-  - )gn)  as  f(g),  and  we  may  write  repeated  applications  (f(z))y  as  f(z,y)  to  avoid 
proliferation  of  parentheses. 

The  simplest  cases  are  when  there  really  is  only  one  formula  on  the  right  of  the  turnstile.  The  com¬ 
putational  content  corresponds  ezactly  with  the  standard  interpretation.  The  rules  in  this  case  are  the 
following: 

Ag.pair(f i(g),f3(g))  :T\-AsB 

f:T\-A  f:ThB 

Ag.inl(f(g)) :  ri-i4V  J3  Ag.inr(f(g))  :  ThA  VB 

t,-.r,BhC  t:T,A\-B 

Ag.Az.fa(g,z(fi(g))) :  T^A^BhC  Ag.(Az.f(g,z)) :  Th  A-B 

There  are  also  cases  in  which  the  rule  is  not  dependent  on  the  number  of  formulas  in  A. 

f  :r,A»-A 

Ag.Az.spr«ad(z;a,v.f(g,u)) :  F,  A  A  B  h  A 
f  :r,BI-A 

Ag.Az.8pread(z;u,v.f(g,v)) :  F,  AABHA 

fi:F,y4l-A  fz.F.BI-A 

Ag.Az.decide(z;u.fi(g,u);v.f2(g,v)) :  F,A  VBh  A 

Finally,  there  are  the  cases  in  which  the  fact  that  there  are  multiple  formulas  on  the  right  of  some 
sequent  is  important.  The  terms  extracted  from  these  cases  are  more  complicated  because  the  conclusion 
has  a  disjunction  t3rpe. 

fi:T\-A,A  f2:FI-B,A 

Ag.d«cide(fi(g);u.d«cida(f2(g);w.inl(pair(u,w));z.inr(z));  v.inr(v)) :  FhA  A  B,  A 

f  :FH^A 

Ag.decide(f  (g);  a.inl(inl(u));  v.inr(v)) :  F  h  A  V  B,  A 
f  :FI-B,A 

Ag.decide(f(g);u.inl(inr(u));v.inr(v)) :  Fh  A  V  B,  A 

fi:FI-A,A  f2:F,BI-A 

Ag.Az.d«cid«(fi(g);u.f2(g,x(u));v.v) :  F,A-*BI- A 

The  case  of  arrow  introduction  on  the  right  where  there  is  more  than  one  formula  on  the  right  is  the  only 
one  where  assuming  the  hypothesis  is  constructive  does  not  imply  the  conclusion  is  constructive.  It  is  here 
we  must  introduce  call/cc. 

f  :F,AI-B,A 

Ag.call/cc(Ah.inl(Ax.decide((f(g))x;u.u;v.any(h(inT(v)))))) :  Fh  A-»B,  A 

To  see  the  intuition  behind  this  term,  we  consider  how  it  will  be  used  in  computation.  Suppose  F  is  empty, 
so  that  we  have  call/cc(Ah.inl(Az.d«cid6(f(z);u.u;v.any(h(inr(v))))))  as  a  closed  term  of  type 
(A— ^B)  V  F.  If  we  use  this  term  in  a  computation  of  a  value,  we  must  perform  a  case  split,  since  it  is 
in  a  disjunction  type.  When  the  term  is  evaluated,  h  is  bound  to  the  current  continuation  and  we  have 
ini (Az. decide (f(z)  ;u.u;v.any(h(inr<v)))))  as  the  term  of  type  (A-+B)  V  F.  So,  we  take  the  branch 
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for  A-*B  with  the  function  Ax. decide (f(z)  :u.u:T.any(h(inr(v))))  as  our  purported  inhabitant  of  the 
type  A-*B.  Now  suppose  this  function  is  applied  to  a  term  a  of  type  A.  Applying  the  function  f  from  the 
hypothesis  of  the  sequent  to  a,  we  have  an  element  of  B  V  A.  Should  this  actually  be  of  type  B,  then  we 
return  that  value.  If  not,  we  have  an  inhabitant  of  A.  We  then  return  the  computation  to  the  stage  at  whidi 
we  took  the  case  split  for  {A-*B)  V  A  by  applying  the  continuation  h  to  our  new  evidence  for  [A-*B)  V  A, 
namely  the  inhabitant  of  A  produced  f  (a).  Thus,  the  continuation  operator  allows  the  computation  to 
switch  paths  when  it  comes  upon  further  evidence  for  which  of  the  disjuncts  is  proved.  This  description  will 
be  made  more  concrete  in  the  next  section  when  the  computation  rules  are  defined. 

The  structural  rules  also  have  term  extractions,  though  their  computational  content  is  uninteresting. 

fiTKA  f:ri-A 

Ag.inr(f  (g)) :  T  h  A,  A  Ag.Ax.f  (g) :  T,  A 1-  A 

f:ri-A,A,A  f:r,A,AI-A 

Ag.decide(f(g);u.inl(u);v.v) ;  FhA,  A  Ag.Ax.f  (g,x,x) ;  r,AI- A 

The  exchanvo  rules  also  require  treatment,  though  their  computational  content  is  trivial. 

f  I  ri”  Aj, . . . ,  At,  At.j.1, . . . ,  Afi 

Ag.docide„(f  (g);  m.ini (ui); . . . ; Ui.iUi+i (u<);ui4.i.ini(ui+i); . . . ; u„.in„(u„)) :  T h  Ai, . . . ,  A^+j,  Aj, . . . ,  A„ 

f  c  Ai,..., At, At-^-i , . . . , Afi K  A 

Axi . . .  Axt4-i  Axt . . .  AXft.f  (xi  t . . .  j  ^+i>  •  •  •  >  ^n)  •  Ai , . . . ,  At+i,  A,,  • .  ■ ,  An  H  A 

4  Reduction  of  terms 

In  this  section  we  describe  a  more  general  term  language  and  present  machine  rewrite  rules  for  computation 
in  this  language.  These  rewrite  rules  will  define  the  operational  semantics  of  the  language,  and  it  is  with 
respect  to  this  semantics  that  we  will  prove  that  reduction  preserves  type. 

We  wish  to  be  able  to  use  these  classical  proof  terms  in  a  generd  context;  for  example,  we  may  want 
to  use  a  classical  proof  term  to  represent  the  propositional  logic  content  of  a  proof  about  the  integers.  To 
do  this,  we  allow  typed  nonlogical  constants  in  the  term  language.  We  cannot  give  reduction  rules  for 
these  constants,  since  they  may  represent  information  about  some  external  theory.  We  instead  require  that 
reductions  involving  these  constants  are  terminating  and  type  correct,  so  that  constants  obey  the  standard 
constructive  mathematical  semantics.  For  example,  if  we  have  a  function  constant  f  of  type  A-»B,  if  we 
apply  f  to  a  of  type  A,  we  are  guaranteed  that  the  result  b  is  of  type  B.  Furthermore  we  must  be  able  to 
treat  b  as  we  would  any  ordinary  member  of  type  B,  so  that  if  B  =  Bo  V  Bi,  we  must  be  able  to  reduce  the 
term  decide (b;u.ti  ;v.t2).  So  we  actually  require  that  all  constants  which  may  appear  during  reduction 
(not  just  the  constants  present  at  the  beginning)  have  the  property  that  reductions  involving  them  are 
terminating  and  type  correct. 

Given  these  assumptions  on  the  behavior  of  the  constants  under  reduction,  we  define  the  program  term 
language  to  be  type-correct  terms  built  from  constants  and  proof  terms  by  any  of  the  term  constructors 
described  above,  except  for  call/cc.  Thus,  we  know  that  all  instances  of  call/cc  in  the  program  are  from 
proof  terms,  and  so  we  know  the  con'  ■.  in  which  they  occur. 

Since  we  may  be  using  these  proof  terms  to  represent  the  logical  content  of  a  proof  in  a  larger  theory,  we 
shall  call  the  result  an  applied  proof  term.  Formally,  we  define  an  applied  proof  term  as  follows; 

•  Proof  terms  are  applied  proof  terms.  The  type  of  a  proof  term  is  the  type  corresponding  to  the  formula 
which  was  proved. 

•  Typed  constants  are  applied  proof  terms. 

•  Typed  variables  are  applied  proof  terms. 
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•  If  ti  and  ta  are  applied  proof  terms  of  type  P\  and  Pi  respectively,  then  pairCti ,  tj)  is  an  applied 
proof  term  of  ^rpe  A  A  and  InKti)  and  isrCta)  are  ^plied  proof  terms  of  type  Pi^  Pt. 

•  If  X  is  a  variable  of  type  Pi  and  t  is  an  applied  proof  term  of  type  then  Az.t  is  an  applied  proof 
term  of  type 

•  If  t  is  an  applied  proof  term  of  type  X,  then  any^(t)  is  an  applied  proof  term  of  type  P.  (The 
superscript  will  be  omitted  when  it  is  dedudble  from  context.) 

•  If  t  is  an  applied  proof  term  of  type  Pi  V  Pi,  and  tiis  an  applied  proof  term  of  type  P  with  free  variable 
a  of  type  I\,  and  tjis  an  ^plied  proof  term  of  type  P  with  free  variable  v  of  type  Pi,  then  decideCt ; 
u.ti ;  v.t])  is  an  applied  proof  term  of  type  P. 

•  If  t  is  an  applied  proof  term  of  type  Pi  A  Pi,  and  tiis  an  applied  proof  term  of  tjrpe  P  with  free 
variables  u  of  type  Pi  end  ▼  of  type  Pi,  then  spread(t;u,v.ti)  is  an  applied  proof  term  of  type  P. 

•  If  f  is  an  applied  proof  term  of  type  P\-*Pi,  and  t  is  an  applied  proof  term  of  type  Pi,  then  applyCf  :t) 
is  an  applied  proof  term  of  type  Pi.  This  term  will  often  be  abbreviated  f  (t). 

An  applied  proof  term  is  called  a  progmm  term  if  it  has  no  free  variables. 

FbUowing  the  example  of  [3],  we  present  the  operational  semantics  as  a  set  of  rules  representing  the 
transition  function  of  a  term  rewriting  machine.  We  use  their  notion  of  a  continuation  point  to  represent  the 
continuation  object;  however,  since  we  do  not  fix  an  evaluation  order,  we  shall  use  a  more  general  notion  of 
evaluation  context  (the  analogous  “applicative  contexts”  defined  in  [3]  are  defined  for  a  particular  evaluation 
order  ).  An  evaluation  context  is  a  term  with  a  hole  in  it,  such  that  the  hole  is  not  within  the  scope  of  any 
binding  operators  (including  those  in  the  body  of  spread  and  decide  terms).  A  program  not  in  normal  form 
can  be  split  into  an  evaluation  context  and  a  redex.  We  use  the  notation  £[A]  for  such  a  program,  where 
.EQ  is  the  evaluation  and  R  is  the  redex.  A  continuation  point  is  an  evaluation  context  tagged  with  p.  If 
.E[A]  is  a  program  of  type  <p  and  R  is  of  type  a,  then  (p,  f^Q)  is  a  continuation  point  which  may  be  applied 
to  any  term  of  type  a  to  produce  a  program  of  type  (p. 

So,  the  reduction  rules  for  terms  are: 


£((Ax.t)a] 
.E[8praad(pair  (a,  b);  u,  v.t)] 
.E[d«cid«(inl(a);  u.ti ;  v.t2 )] 
.E[decide(inr(b);  u.ti ;  v.t2)] 
.E[call/cc(Ak.t)] 
E[{{p,Ek^M 


E[t[x  :=  all 
£[t[u  :=  a,v  :=  b]] 
JS?[ti[u  :=  a]] 
f?[t2[v  :=  b]] 

f?(t(k:=(p,f;D)l] 


(plus  the  assumptions  described  above  about  reductions  involving  constants.) 

The  rule  for  call/cc  described  above,  though  not  fully  general,  is  sufficient  since  all  of  the  terms  extracted 
from  proofs  will  have  the  property  that  the  call/cc  operator  will  only  be  applied  to  terms  of  the  form  Ak  .t. 

Note  that  from  these  rules  alone  we  can  conclude  that  reduction  of  type  correct  programs  preserves  the 
type  of  the  whole  program. 

Theorem  1  If  t  is  a  well-typed  program  term  of  type  <p,  and  t  reduces  to  t',  then  t'  has  type  ip. 

Proof:  The  only  cases  to  verify  are  the  rules  for  reductions  of  call/cc  terms,  and  the  application  of 
continuation  points.  For  the  case  of  cedl/cc,  if  we  have  E[call/cc(Ak.t)]  is  of  type  ip  and  call/cc(Ak.t) 
has  type  a,  we  have  that  Ak.t  has  type  -'a-*a,  since  the  program  is  well-typed.  Internally,  then,  the 
continuation  point  (p,.^fl)  which  is  substituted  for  k  will  have  type  o-»X,  but  in  fact  JS{]  represents  a 
program  of  type  ip  with  a  hole  in  it  for  a  term  of  type  a.  (If  we  had  A-translated  the  whole  program,  this 
difference  between  the  apparent  type  of  the  continuation  within  the  program  and  the  actual  type  of  the 
continuation  in  computation  would  not  appear.)  So,  the  term  t  [k  :=  (p,  .BQ)]  has  type  a,  so  the  reduction 
of  call/cc(Ak.t}  preserves  the  type  of  the  whole  term. 
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fbr  the  case  of  the  api^cation  of  continuatioa  points,  we  showed  above  that  if  call/cc(Xk.t>  has  ^rpe 
a,  then  £Q  represents  a  program  of  type  >p  with  a  hole  in  it  for  a  term  of  type  a.  Since  its  internal  type  is 
a-*l,  though,  we  know  that  if  it  is  applied,  it  must  be  applied  to  a  term  a  of  type  a,  since  the  program 
is  type  correct.  This  reduction  results  in  £(a],  which  is  a  pn^am  of  type  ip  because  ^call/cc(t)]  was  of 
type^.  □ 


5  Proof-theoretic  interpretation  of  reduction 

5.1  Expanded  sequent  calculus 

Since  terms  in  the  term  language  correspond  more  closely  to  natural  deduction  proofs  than  to  sequent  proofs, 
in  order  to  reason  about  properties  of  terms  as  properties  of  proofs  we  shall  extend  the  notion  of  proof  with 
additional  rules.  The  result  is  a  hybrid  between  natural  deduction  and  sequent  systems,  with  the  natural 
deduction  steps  appearing  as  intermediate  steps  in  the  sequent  proof.  Reductions  will  generally  correspond 
to  some  preliminary  rearranging  of  the  proof,  followed  by  a  sequence  of  cut  eliminations.  The  procedure  of 
substitution  corresponds  to  a  sequence  of  cut  elimination  steps. 

We  add  to  the  normal  collection  of  sequent  rules  (including  cut)  the  rules  corresponding  to  elimination 
of  connectives  in  natural  deduction.  We  shall  then  have  that  every  redex  corresponds  to  a  natural  deduction 
rule  or  a  cut,  although  not  every  cut  will  correspond  to  a  redex. 

A  decide  redex  corresponds  to  a  rule  of  the  form: 

hc'.AVB  uiAHfirP  v:Bhlj;P 

l-d«cid«(c;n.fi;v.f2) :  P 

This  may  also  appear  as 

i-c:A,B  u-.Ak-tiiP  y.Bhii'.P 
Kd«cide(c;u.fi;v.f2) :  P 

because  of  the  implicit  disjunction  on  the  right  of  the  turnstile.  ^ 

Similarly,  a  spread  which  may  be  reduced  corresponds  to  a  rule  of  the  form; 

hciAAfl  vl:  A,v :  B\-f  :  P 

h8pr«ad(c;u,v.f) :  P 

Finally,  the  -^-elimination  rule  corresponds  to  application: 

ha  :  A  Hf  :  A—*B 
Hl(a);B 

If  f  (a)  is  actually  a  redex,  this  will  reduce  to  an  ordinary  cut  rule. 


5.2  The  map  from  terms  to  proofr 

Since  we  wish  to  reason  about  properties  of  terms  by  reasoning  about  the  proofs  from  which  they  are 
derived,  we  need  to  associate  a  proof  tree  with  each  program  term.  Furthermore,  it  is  helpful  if  the  proof 
tree  structure  reflects  the  term  structure,  so  we  need  to  augment  the  sequent  proof  with  some  redundant 
information  corresponding  to  the  syntax  of  the  term  associated  with  each  sequent  rule. 

First,  we  describe  the  modifications  needed  to  make  the  original  sequent  proof  correspond  to  the  term 
structure.  The  abbreviation  f(g},  used  above  for  repeated  applications,  requires  additional  explanation  here 
if  each  application  redex  is  to  correspond  to  a  single  cut.  In  what  follows  we  shall  use  a  notation 


g:ri-g:r  f:ri-A 
g:ri-f(g) :  A 


the  reet  of  thia  paper,  all  of  the  implicit  diajunctiona  occatiing  on  the  right  aide  of  the  turnatile  are  now  conaidered  to 
be  explicit.  Thia  allowa  ua  to  treat  the  proof  aa  a  paendo-conatructive  proof  with  only  one  concluaion  of  each  aequent. 
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for  repeated  applkatkm,  whnre  g :  F  is  a  liet  zi :  Ai, . . .  ,Zn  :  An,  aad  tL.e  above  cut  ie  aa  abbreviatioa  for 
the  aequeace  cuta 

zi :  Ai,...,z«  :  AnI-zi :  Ai  f  :  Ai,...,AnHA 
Zl  :  Alt .  •  •  i^n  •  ■^n»^(3tl)  •  Aj, . . .  I  An  A 


Zi  :  All  •  •  •  I Zn  I  An  ^  Zn  !  An  3ti  :  Alt  •  •  •  i  Zn  ‘  Ani^(zi| . . .  ,Zn— l)  I  An  ^  A 

Zi  :  Ai,...,Zn:AnHf(zi,...,z„):A 

where  after  the  first  rule,  the  term  f  (zi)  has  tjrpe  Aa-*(A3-»(. . .  -►(An-»A))),  etc.  This  simply  describes 
the  repeated  application  of  f  to  each  of  the  new  variables  zi . . .  Zn,  where  each  cut  corresponds  to  one 
application. 

We  give  a  few  ezamples  of  the  remaining  cases.  In  general,  the  structure  of  the  term  associated  with  the 
conclusion  of  the  rule  governs  the  structure  of  the  proof  into  which  the  rule  ezpands.  The  effect  of  these 
transformations  is  mainly  to  make  function  applications  and  other  operations  which  are  implicit  in  the  term 
explicit  in  the  proof. 

For  ezample,  the  sequent 

f  :r,AhA 

Ag.Az.8pr«ad(z;u,v.f(g,u)) :  F,  A  A  B  H  A 


becomes 


z:AABhz:AAB 


g  :  F,u :  A,v :  Bhg:  F,u :  A  f;F,AhA 
g:  F,u:  A,v:  Bl-(f(g))u:  A 


Ag.Az.8pr«ad(z;  u,  v.(f  (g))u))  :  F,  A  A  B  I-  A 
As  another  example,  the  sequent 

f  :FI-A,A 


Ag.d«cide(f(g);u.inl(inl(u));v.iiir(v)) :  Fh  A  VB,A 


becomes 


u : Ahu : A 


g:FI-g:F  f:FI“A,A  u:AI-  inl(u) :  A  V  B 


V : Ahv : A 


g:FI-f(g):  A,A 


u :  A  I-  inl(inl(n)) :  A  V  B,  A 


V  :  A  I-  inr(v) :  A  V  B,  A 


Ag.decid«(f(g);u.inl(inl(u));  v.inr(v)) :  FI- A  VB,  A 
There  is  one  case  where  the  expansion  is  not  obvious. 


f  :F,AI-B,A 

Ag.call/cc(Ah.inl(Az.d«cid6((f(g))x;u.u;  v.any(h(inr(v)))))) :  Fh  A— *B,  A 

The  expansion  must  mirror  the  structure  of  the  term,  but  it  is  not  clear  how  the  call/cc  and  the  associated 
variable  h  should  be  treated.  Since  the  evaluation  of  a  call/cc  binds  the  variable  to  the  current  continuation, 
we  shall  make  this  binding  explicit  by  adding  an  extra  cut  with  a  placeholder  representing  the  eventual 
continuation.  We  shall  call  this  placeholder  h  as  well,  and  represent  it  as  a  constant.  The  last  step  in  the 
derivation  is  then: 

hh  :  ((A-B)  V  A)-.±  g  :  F,h  :  ((A-B)  V  A)-±h  i^(Ax.decide((f(g))x;u.u;  v.any(h(inr(v))))) 

•  y  AA 

Ag.call/cc(Ah.inl(Ax.decide((f(g))x;u.u;v.any(h(inr(v)))))) :  Fh  A-»B,  A 
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In  the  deimtion  of  the  body  of  the  call/cc,  the  varinble  h  remains  an  assumption  along  the  branch  of 
the  proof  corresponding  to  the  second  branch  in  the  decide: 


T : AHt : A 

h :  ((A-*B)  V  A)-*±  H  h :  ((A-*B)  V  A)-»i.  ▼  :  A  h  inr(T) :  (A-*B)  V  A 

h  :  ((A-*B)  V  A)-»±,»:  Ai-h(inr(T)) :  ± 
h :  ((A-*B)  V  A)-»X, V :  AH any(h(inr(T))) :  B 

The  remainder  of  the  derivation  of  the  body  of  the  call/cc  is: 
g:r.x  :  AHg:r.x:A  f-  F.AHB.A 

g  :  r,x  :  Ah(f(g))x  :  B,A  n:BI-u;B  (seethe) 

g  :  r,  X  :  A,  h  :  ((A-»B)  V  A)-*±  h  decide((f  (g))x;  u-u;  v.any(h(inr(v)))) :  B 
g  :  r,h  :  ((A-*B)  V  A)— »±I-Ax.decide((f(g))x;u.n;v.any(h(lnr(v)))) :  A-*B 
g  :  r,h :  ((A-+B)  V  A)-»±Hlnl(Ax.decide((f(g))x;u.u;v.any(h(inr(v))))) :  A-*B,  A 

We  have  described  how  to  augment  the  sequent  proof  with  the  information  needed  to  model  computation 
with  it.  We  must  now  associate  sequent  trees  with  the  rest  of  the  term  in  a  similar  manner.  As  previously 
described,  the  program  term  is  constructed  from  terms  extracted  from  sequent  proofs,  variables,  and  typed 
constants,  using  the  usual  term  constructors.  The  sequent  tree  corresponding  to  the  program  term  is  con¬ 
structed  using  the  type  information  and  the  structure  of  the  part  of  the  program  term  not  described  by 
the  sequent  proof.  In  essence,  we  construct  a  proof  of  the  formula  corresponding  to  the  type  of  the  whole 
program,  from  assumptions  corresponding  to  the  types  of  the  constants. 

To  describe  computation  with  constants,  we  associate  with  each  constant  a  sequent  tree  of  the  following 
form,  according  to  its  type.  For  a  constant  c  of  type  P,  the  sequent  tree  is  defined  inductively  as  follows. 
We  shaU  define  a  map  [•]  from  sequents  to  sequent  trees,  and  the  tree  associated  with  c:  P  will  be  the  tree 
(hP). 


[ri-pi 

[ri-AAB| 

[TI-A-BI 

irh-A] 


FHP  if  Pis  atomic 

[THA)  [THB] 
ri-AAB 
[r,AHB] 

ThA-^B 

r,AHi 

ri--.A 


Finally,  we  define 


(ri-AVB] 


[Tl-A]  fri-B] 

— - —  or  — - — 

ri-AvB  FhAVB 


though  we  cannot  know  which  until  the  a  cut  with  F  I-  A  V  B  is  actually  eliminated.  For  example,  this  case 
arises  when  we  have  a  constant  c  of  type  A— »(B  V  C).  We  cannot  know  the  type  of  the  result  until  c  has 
been  applied;  since  we  assume  that  the  evaluation  of  constant  functions  terminates  and  is  constructively 
type  correct,  we  will  be  able  to  decide  what  type  the  result  of  a  particular  application  of  c  actually  has. 

The  above  definitions  correspond  to  a  tableau  proof  development  except  that  formulas  on  the  left  of  the 
turnstile  are  not  broken  down.  This  is  because  formulas  on  the  left  are  arguments  to  the  constant  function, 
and  the  function  must  be  applied  to  arguments  of  the  appropriate  type. 

We  now  have  sequent  trees  associated  with  constant  terms  and  with  the  term  extracted  from  the  original 
sequent  proof.  We  now  define  a  map  from  terms  built  from  these  terms  to  sequent  trees.  The  map  will  be 
denoted  {•} £>,  where  B  is  a  list  of  the  typed  bound  variables  at  any  point  in  the  term. 
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{c :  P}d 

{palr(x,y) :  A ^B}D 
{iiil(z) :  i4  V  J3}x) 
{iiir(z) :  i4  VB}d 
{Az.t :  A-*B}d 
{d«eld«(d;u.ti;T.ta) :  P}d 
{■pr«ad(p;tt,v.t) :  P}d 
{apply(f;z) :  B}d 
{any(z) :  A}d 


the  zugmented  sequent  proof  described  above 
[hi*] 

{z :  A)d  {y  :  B}d 

Bhpair(z,y) :  A/kB 

D  K  inl(z) :  ri  V  B 
{x :  B}d 

D h inr(z) :  A^ B 

{t  :  B}x:A,i> 

Bh  Az.t :  A-*B 

{d :  A  V  B}i>  {ti  :  P}ii:a,d  {^a  :  P}v:B.jd 
i>l-decide(d;n.ti;v.ta)  :  P 

{p:  A  A  B}d  {t  :  P}u:A.V:B.i> 
Z)l-spread(p;u,T.t)  :  P 

{z :  A)d  {f  :  A-*B}d 
i)l-apply(f;z) :  B 

{x:l}z) 

D  I-  any(z) :  A 


5.3  Reduction  as  proof  transformation 

In  this  section,  we  describe  the  correspondence  between  reduction  steps  in  the  program  term  and  cut  elimi¬ 
nation  steps  in  the  proof. 

Most  reduction  steps  in  the  program  will  correspond  to  an  initial  step  which  creates  a  cut,  followed  by 
a  sequence  of  cut  elimination  steps.  This  is  because  substitution  of  a  term  for  a  variable  involves  passing 
through  the  proof  tree  until  the  place  where  the  variable  is  introduced  (as  part  of  an  axiom  sequent)  is 
found,  and  substituting  a  proof  for  that  axiom.  Note  that  this  corresponds  more  closely  with  the  actual 
complexity  of  substitution,  since  the  proof  tree  corresponds  closely  to  the  syntactical  structure  of  the  term. 
Most  reduction  steps  follow  a  pattern  of  setting  up  a  simple  cut  and  eliminating  it,  in  a  sequence  of  cut 
elimination  steps. 

In  the  remainder  of  this  section,  I  will  describe,  for  each  kind  of  reduction,  how  reduction  transforms  the 
proof. 

Note  that  the  first  premise  sequent  of  a  redex  must  be  a  sequent  with  no  hypotheses,  since  we  do  not 
allow  reductions  within  the  scope  of  a  binding  operator. 


5.3.1  Simple  cuts. 

Simple  cuts  arise  from  the  reduction  of  any  of  the  more  complex  rules.  The  elimination  of  a  cut 

ha  :  j4  z  :  Aht :  B 
h(Az.f)a:B 
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corresponds  to  /^-reduction,  so  we  must  describe  how  the  argument  m:A  comes  to  be  substituted  for  z  in 
f .  Since  the  structure  of  the  term  corresponds  so  closely  with  the  structure  of  the  proof  tree,  in  order  for 
the  substitution  of  the  argument  for  the  bound  variable  to  take  place,  the  argument  must  be  propagated 
towards  the  leaves  of  the  tree  to  the  place  where  the  variable  is  introduced  in  an  axiom  sequent.  Observe 
that  this  is  almost  immediate  in  the  terms  which  come  directly  from  the  sequent  proof,  because  these  terms 
are  constructed  via  application  from  the  term(s)  representing  the  hypothesis.  For  general  terms,  however, 
this  is  not  true,  and  it  may  take  several  cut  elimination  steps  for  the  argument  to  reach  the  places  where 
the  variable  was  introduced.  So  we  must  pass  the  cut  with  a:^  towards  the  leaves  of  the  sequent  tree  until 
it  reaches  the  point  where  the  variable  z  was  introduced  as  an  axiom. 

The  procedure  is  described  by  induction  on  the  structure  of  the  proof  ofx  :  A\-f  :  B.  To  eliminate  a  cut 
with  an  axiom  sequent,  as 

Ha  :  >1  X  :  j4,g  :  FHx  :  A 
TKA 

(r  may  of  course  be  empty),  replace  this  cut  with 

FHa :  A. 

To  eliminate  a  cut  with  a  sequent  not  an  axiom,  suppose  that  Pq  H  Aq  and  Fi  H  Ai  are  the  hypotheses 
of  the  sequent  rule  resulting  in  x  :  ^4,  g  :  F  H  d  ;  A.  First,  replace  the  conclusion  of  the  cut  rule  with  with 
g  :  FHd  :  A.  If  x:  >1  is  not  in  either  list  Fo  or  Fi,  then  we  are  done.  Otherwise,  continue  the  process 
recursively  with  the  cuts 

Ha:yl  FqHAo  Ha:i4  FiHAi 

-  and  - 

Ffl  — x:AHAo  Fi  — xiAHAi 

(assuming  both  are  applicable,  i.e.  x  :  ^4  is  in  both  Fo  and  Fi  ). 

In  other  words,  we  replace  the  cut 

FflHAo  FjHAi 

h&:  A  x:Ai,g:FHd:A 

FHA 

Ha:.4  FoHAo  Ha:i4  FiHAi 

Fq  — x:i4HAo  Fi— x:i4HAi 

g:FHd: A 

and  continue  the  cut  elimination  process  with  these  new  cuts  until  an  axiom  is  reached. 

Note  that  although  we  seem  to  be  introducing  new  sequents  Fo  -  x  :  AH  Aq  and  Fi  -  x  :  AHAi,  in 
fact  these  are  eliminated  at  the  next  stage  in  the  process.  In  general,  the  elimination  of  a  cut  removes  its 
conclusion  sequent  and  replaces  it  with  another  sequent  proving  the  same  formula. 

5.3.2  Reduction  of  decide 

There  are  two  cases,  which  reduce  in  essentially  the  same  way. 

1.  decide(inl(a) ;u.ti ;v.t2) 

The  redex  decideCinlCa)  ;u.ti  ;v.t2)  corresponds  to  a  proof  segment  of  the  form 

H  a :  A 

Hinl(a):AvB  u:AHti:P  v:BHt2:P 

Hdecide{inl(a);u.ti;  v.t2) :  P 

This  first  reduces  to 

H  a  :  A  u  :  AHtx  :  P 
HP 

and  then  the  process  described  above  for  simple  cuts  is  performed. 


51 


2.  d«cid«(iiir(b);a.ti;T.t2) 

The  redex  decld«(inr(b)  :tt.ti  :v.t]}  reduces  similarly  to  the  previous  case. 

hh.B 

H inr(b) :  i4  V iS  u;i4l-ti;P  v.Bhti'.P 

t-decld«(iiir(b);tt.ti;v.t3) :  P 


This  first  reduces  to 


hb:B  V  :  Bi-t2  :  P 


bP 

Then,  this  cut  is  eliminated  according  to  the  rules  for  simple  cuts. 


5.3.3  Reduction  of  spread 

The  redex  spreridCpairfa.b)  ;u,v.t)  corresponds  to  the  proof  fragment 

h  a  :  hb  :  R 

l-pair(a,b) :  j4  Afl  u:A,v:BI-t:P 

l-8pread(pair(a,b);u,v.t)  :  P 

After  reduction,  the  proof  becomes 

l-arA  u:A,v:BHt:P 

hb:  B  TTbHP 

hP 

Then,  these  simple  cuts  are  eliminated.  Note  that  a  sequent  (J3 1-  P)  has  been  created,  but  that  it  disappears 
after  the  cut  of  B  is  eliminated,  so  that  after  the  reduction  we  have  a  proof  of 

ht[u  :=  a,v  :=  b] :  P. 

5.3.4  Reduction  of  applications 
An  application  has  the  following  form: 

f-  a  :  A  h  Ax.f  :  A—*B 

H  (Ax.f  )a :  B 

Since  we  have  Ax.f  as  the  inhabitant  of  type  A-*B,  the  previous  rule  must  have  been  an  arrow  intro¬ 
duction  on  the  right.  Hence  we  must  have 

x: Ahf : B 
ha:  A  hAx.f  :  A-B 

h  (Ax.f  )a  :  B 

This  reduces  to 

ha:A  x:Ahf:B 

h  (Ax.f  )a :  B 

without  any  change  in  the  term  itself,  said  then  this  cut  is  eliminated  according  to  the  rules  for  simple  cuts 
above. 
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5.3.5  Coutinuatioii  proob 

To  model  the  capture  and  application  of  continuations  in  the  language,  we  shall  define  the  notion  of  a 
continuation  proof.  Informally,  this  will  be  a  partial  proof  which  requires  another  proof  to  be  complete;  in 
essence,  it  will  be  a  function  from  proofs  to  proob.  For  any  given  proof  of  a  formula  we  will  have  that 
any  continuation  proof  arising  in  the  reduction  of  that  proof  will,  when  the  continuation  proof  is  applied 
according  to  the  rules  described  here,  result  in  a  different  proof  of  <p. 

Under  the  ordinary  intuitionistic  interpretation  of  the  logical  connectives,  a  term  representing  a  proof  of 
an  implication  A-*B  can  be  considered  a  function  from  proofs  of  A  to  proofe  of  B.  A  continuation  proof 
term  is  also  a  function  from  proofs  to  proofs;  however,  it  behaves  differently  when  applied  within  the  context 
of  another  proof.  Suppose  we  have  a  proof  of  a  formula  ip.  A  continuation  (sub)proof  within  that  proof  will 
appear  to  be  a  proof  of  some  formula  A-»±,  and  the  corresponding  continuation  proof  term  will  appear  to 
have  the  type  A— »±.  Should  that  term  actually  be  applied,  however  (in  order  to  produce  an  inhabitant  of  ±), 
what  results  is  not  a  term  of  type  ±  within  a  term  of  type  p,  but  rather  a  new  term  of  type  ip,  corresponding 
to  a  new  proof  of  (p.  (Under  A-translation,  the  ±  in  the  proof  would  have  been  translated  to  a  ^  already.) 

The  notation  we  shall  use  for  the  continuation  proof  terms  is  intended  to  hint  at  their  role  as  functions 
from  proob  to  proofs.  However,  we  do  not  wish  to  consider  the  internal  structure  of  the  continuation  proof 
corresponding  to  the  term.  Accordingly,  we  denote  a  continuation  proof  term  of  type  A-»J.  as 

Ax.O 

This  represents  a  proof  of  (p  with  a  “hole”  of  type  A  in  it,  so  that  if  Ax.Q  is  applied  to  an  argument  of  type 
A,  the  result  is  a  new  proof  of  <p,  i.e.  a  new  program  of  type  p.  Describing  a  continuation  proof  itself  is 
trickier.  Associated  with  every  continuation  proof  term  Ax.[}  is  a  proof  which  determines  the  result  when  the 
continuation  term  is  applied.  To  describe  this  incomplete  proof,  we  use  an  ordinary  sequent  proof,  except 
that  at  one  leaf,  instead  of  an  identity  axiom,  we  have  the  sequent 

hx :  A 

as  an  axiom.  When  the  continuation  proof  is  applied,  this  sequent  is  replaced  with  a  real  proof  of  h  A,  and 
the  term  associated  with  this  proof  is  substituted  for  x.  (In  a  language  using  C  and  A  this  would  be  an 
ordinary  application  followed  by  an  abort;  since  we  are  using  call/cc,  however,  the  application  and  the 
abort  are  performed  in  one  step.) 

Continuation  proofe  arise  from  the  reduction  of  a  term  c2Ql/cc(t).  In  this  system,  the  only  way  such  a 
term  arises  is  as  the  result  of  an  — »  introduction  rule  on  the  right,  when  there  was  more  than  one  formula 
on  the  right.  Thus  we  know  that  the  form  of  the  proof  leading  to  the  term  call/cc(t)  is  as  described  in 
the  previous  section.  The  redex  itself  appears  as  the  sequent 

h  h  :  ((A-*fl)  V  A)— h  :  ((A-»H)  V  A)-*±  h  inl(Az.decide(f  (x);  u.u;  v.any(h(inr(v))))) :  A-*B,  A 

I-  call/cc( Ah.inl(Ax.d6cide(f  (x);  u.u;  v.any(h(inr(v)))))) :  A-*B,  A  a 

_ 

where  a  represents  the  proof  of  the  other  hypotheses  (if  any)  of  the  rule  at  this  point,  0  represents  the  rest 
of  the  proof  below  this  rule,  and  the  proof  above  the  call/cc  is  as  previously  described. 

Reduction  of  this  term  creates  the  continuation  proof 

l-d:A-*HVA  a 

where  d  is  a  new  variable.  The  continuation  proof  term  associated  with  this  proof  is  just  Ad.Q,  and  it 
has  type  ((A-+5)  V  A)— »±.  This  continuation  proof  term  is  then  passed  as  an  argument  to  the  term 
(Ah.inKAx.decideCf  (x)  ;u.u;v.any(h(inr(v)))))),  where  it  is  substituted  for  h.  When  the  dust  clears 
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and  the  reduction  of  the  original  call/cc  is  complete,  the  proof  structure  is 

H  (Ad.O)  :  ((A-B)  V  A)->±  v  :  A  H  inr(T) :  ( A-B)  V  A 

x:AI-x:j4  v :  AH  (Ad.0)(inr(v)) :  ± 

X :  Al-f(x) :  B,A  u:jBI-u:fl  v :  A h any((Ad.O)(inr(T))) ;  B 

X  :  Al-decide(f(x);u.u;v.any((Ad.[])(inr(v)))) :  B 

l-Ax.decide(f(x);u.u;v.aay((Ad.D)(inr(T)))) :  A-*B 

I-  inl(Ax.d«clde(l(x);u.u;T.any((Ad.Q)(inr(T))))) :  A-*B,  A  a 

_ 

We  now  describe  how  reduction  behaves  with  continuation  proof  terms.  Since  continuation  terms  arise 
only  in  the  context  shown  above,  if  the  term  is  in  a  redex,  it  must  be  that  we  have  taken  the  right  branch 
of  the  decide  with  f  (a)  of  type  A  substitute<’  for  v.  So,  the  proof  structure  must  be 

1-  (Ad-O)  :  ((A-»fl)  V  A)-X  h  inr(f  (a)) :  (A-B)  V  A 

H  (Ad.Q)(inr(l(a))) :  X 

Since  this  is  a  continuation  term,  the  reduction  of  (Ad.  [] )  (inr(f  (a)))  transforms  the  whole  proof  to 

h  inr(f  (a)) :  A— ►B  V  A  a 
- 

where  the  context  of  this  sequent  is  the  context  described  above,  so  that  the  proof  reverts  to  the  stage  at 
which  the  call/cc  term  was  reduced,  with  the  additional  information  from  the  proof  of  Hf(a) :  A. 

6  Conclusion 

We  have  presented  a  method  for  extracting  programs  from  classical  sequent  proofs,  which  uses  the  control 
operator  call/cc  to  represent  the  classical  axiom  {-<P-*P)-*P.  As  a  result,  we  have  a  simple  proof  that 
reduction  of  such  terms  is  type  preserving,  and  a  proof  theoretic  framework  in  which  to  treat  problems  of 
reduction.  It  is  hoped  that  this  will  allow  the  use  of  tools  from  proof  theory  to  help  solve  questions  about 
reduction,  such  as  normalization  properties. 
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Abstract 

Given  a  polynomial  /  6  C^z],  it  defines  a  vector  field  Nf{z)  -■ 
-f{z)/f\z)  on  C  Certain  degenerate  curves  of  fiow  in  Nf  giw  the 
edges  of  the  Newtoman  graph,  as  defined  by  |6].  These  give  a  relation 
between  the  roots  of  /  and  mudi  similar  to  the  linear  order,  when 
/  has  real  roots  only. 

We  give  an  algorithm  to  compute  the  Newtonian  graph  and  the 
basins  of  attraction  in  the  Newtonian  field.  The  resulting  structure 
can  be  used  to  query  whether  two  points  in  C  are  within  the  same 
basin  of  attraction  in  Nf.  This  gives  us  an  interesting  approach  to 
use  Newton’s  method  to  find  all  roots  of  /,  guaranteeing  that  we  con¬ 
verge  to  a  root.  This  method  extends  to  rational  functions  and  more 
generally  to  any  fimctions  on  C  whose  flow  satisfies  certain  algebraic: 
conditions. 


1  Introduction 

We  follow  the  definitions  of  Smale  [6]  and  define  the  Newtonian  vector  jield 
of  a  polynomial  /  €  C[z]  by  Nf{z)  =  The  name  is  derived  from  the 

fact  that  Xk+i  Xk  +  Nf{xk)  is  Newton’s  method. 
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The  vector  field  Nf  defines  a  flow  on  C  where  the  flow  comes  (almost 
everywhere)  from  infinity  and  converges  (almost  everywhere)  to  a  root  of 
/.  There  are  degenerate  curves  of  flow  connecting  roots  of  /  and  /',  and 
the  basins  of  flow  split  C  into  finitely  many  regions.  These  connecting  curves 
will  be  the  edges  of  our  Newtonian  graph  (to  be  defined  more  formally  later). 
This  graph  has  been  studied  and  the  types  of  graphs  that  arise  have  been 
classified  [7]. 

We  will  give  a  symbolic  algorithm  to  compute  the  graph,  given  a  polyno¬ 
mial.  Furthermore  our  algorithm  will  find  the  basin  boundaries.  The  output 
of  the  algorithm  is  a  structure  which  can  act  as  an  oracle  to  answer  simple 
questions  such  as 

1.  Given  a,b  €C,  axe  a  and  b  in  the  same  basin? 

2.  Given  a,  6  6  C,  are  a  and  6  on  the  same  curve  of  flow? 

3.  Given  a  G  C  is  a  on  a  basin  boundary? 

4.  Given  a  €  C  is  a  on  a  graph  edge? 

So  not  only  do  we  get  the  topology  of  the  graph,  we  get  a  method  for  mem¬ 
bership  testing  for  the  interesting  regions  in  the  field.  Such  a  structure  can 
for  instance  be  used  in  a  “guaranteed”  Newton’s  method,  modifying  the  step 
size  at  every  point  to  ensure  that  we  stay  within  a  basin. 

We  then  sketch  how  to  extend  the  definition  of  a  Newtonian  graph  for 
rational  functions.  We  also  observe  that  the  resulting  fields  on  C  satisfy 
certain  algebraic  conditions.  Given  such  conditions  we  can  define  the  graph 
and  compute  it. 

2  The  Newtonian  Graph 

We  have  defined  the  Newtonian  field  of  a  polynomial.  A  vector  field  such  ;us 
Nf  on  C  defines  a  flow  on  C.  Given  z  e  C  the  flow  through  z  is  a  function 
:  /  — >  C,  where  /  C  K  is  an  interval  containing  zero,  0,  differentiable  wit  h 

^  =  N,(m) 

0JO)  =  z. 
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That  is,  <t>  parameterizes  the  flow  starting  at  z  and  at  every  point  the  speed 
and  direction  agrees  with  the  field.  An  example  of  flow  of  /  of  degree  4  is 
given  in  figure  1. 


Figure  1:  Flow  in  the  Newtonian  field  of  a  degree  4  polynomial.  Every'  curve 
of  flow  is  directed  to  a  root,  except  the  basin  boundaries  (dotted  lines).  There 
is  a  root  of  /'  on  every  basin  boundary,  and  a  curve  of  flow  from  there  1o 
“adjacent”  roots  (also  dotted  lines). 

The  flow  exists  on  all  of  C\t'/»  (where  V/.  =  {2  e  C  :  f'{z)  =  0}).  The 
existence  and  uniqueness  follows  from  the  theory  of  differential  equations  and 
the  fact  that  Nf  is  a  function  on  C\V/»  (see  e.g.  [3],  §8.2  and  §8.5). 

The  following  lemma  [7]  gives  us  a  very  important  property  of  the  flow: 

Lemma  1  Let  f  €  C[z],  and  <p,  be  flow  through  z  in  the  Newtonian  field 
Nf.  Then  f  maps  the  curve  {<t>i{t)  :  t  £  1}  to  a  straight  line  pointing  to 
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the  origin.  More  specifically, 

/(Mt))  =  e-^f{z). 

Proof.  Computing  using  the  chmn  rule  gives: 

= 

= 

which  is  a  differential  equation  in  t  for  the  function  p{t)  =  Given 

the  initial  condition,  p(0)  =  f(z),  it  has  the  unique  solution  p{t)  =  e~*f(z), 
i.e.  =  e-*f(z).  □ 

Using  the  properties  of  (f>  one  can  show  the  following 

Lemma  2  For  every  z  e  C\(V/  U  V/»),  is  defined  on  a  maximum  inteival 
containing  0,  which  is  of  the  following  type,  for  some  a,  be  R: 

1.  (-oo,+oo),  and  the  flow  comes  in  from  infinity  and  goes  to  a  root  of 

f: 

2.  (-00,  a)  and  the  flow  comes  in  from  infinity  and  goes  to  a  root  of  f ; 

3.  (a,  6)  and  the  flow  comes  in  from  a  root  of  f  and  goes  to  another  loot 
off; 

4.  (a,  +00)  and  the  flow  comes  in  from  a  root  of  /'  and  goes  to  a  root  of 

/. 

Proof.  Nf  is  a  function  IV  — »  C  where  W  =  C\(V/  U  V/*),  and  by 
theorem  in  §8.5  in  [3],  all  flow  must  leave  any  compact  set  of  IV.  By  Lemma 
1,  the  (maximum)  interval  of  0,  is  unbounded  upwards  iff  the  flow  goes  to 
a  root  of  /.  The  same;  argument  shows  that  the  interval  is  not  downward 
bounded  iff  the  flow  comes  in  from  00.  Since  the  flow  leaves  any  compact  set 
of  IV  the  only  other  limit  points  are  in  Vf.  □ 

Definition  3  The  Newtonian  Graph  of  /  €  C[z]  is  the  plane  graph  G  = 
(V,  E)  with  vertices  V  =  V/  U  Vf  and  directed  edges  being  the  curves  of  flow 
between  vertices,  where  these  exist.  □ 
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Figure  2:  The  graph  of  the  field  in  figure  1.  The  solid  dots  are  the  roots  of 
the  polynomial,  the  hollow  ones  are  the  roots  of  the  derivative. 

We  note  that  the  graph  is  not  just  a  combinatorial  structure,  as  the  edges 
come  with  an  embedding  defined  by  (f). 

Figure  2  shows  the  Newtonian  graph  of  the  field  shown  in  figure  1.  An¬ 
other  example  in  figure  3  shows  that  there  can  be  connections  between  two 
roots  of  /'. 

We  observe  that  under  /,  every  edge  maps  onto  a  straight  line  segment 
pointing  to  0  in  C,  with  at  least  one  endpoint  in  {/(c)  :  f[c)  —  0}.  This  is  am 
immediate  consequence  of  Lemma  1  and  the  fact  that  edges  are  curves  of  flow. 
We  can  conversely  look  at  the  pre-images  (under  /)  of  such  line  segments,  and 
we  get  finitely  any  curves  (at  most  n(n  -  1),  since  /  is  an  n  to  1  mapping, 
and  /'  has  at  most  n  -  1  roots).  We  will  use  this  observation  later,  that  the 
graph  is  contained  in  the  pre-image  /“^({m/(c)  :  /'(c)  =  0,0  <  m  <  1}). 
Thus  the  graph  has  finitely  many  edges.  Furthermore  [7]  show  that  the  graph 
is  connected  and  go  on  to  classify  the  possible  types  of  graphs  that  can  arise. 

A  basin  of  attraction  is  a  connected  region  where  the  flow  comes  in  from 
infinity  and  goes  to  one  particular  root  of  /.  A  basin  boundary  is  the  bound¬ 
ary  of  two  basins.  It  is  not  hard  to  show  that  there  must  be  a  root  of  /'  on 
every  basin  boundary,  because  it  requires  a  discontinuity  of  Nf  for  the  flow 
to  “split”  into  two  directions,  and  these  are  only  at  the  roots  of  /'.  Also  the 
basin  boundaries  must  be  curves  of  flow  themselves,  so  we  conclude  that  every 
basin  boundary  is  flow  into  a  root  of  /'.  In  particular  this  means  that  basin 
boundaries  are  contained  in  the  pre-image  f~^{{mf{c)  :  f'{c)  =  0,1  <m}). 
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Figure  3:  The  field  and  the  graph  of  a  degree  3  (real)  polynomial  where  the 
two  derivative  roots  are  linked. 

3  Computing  Basins  and  Graph  Edges 

We  will  give  an  algorithm  to  compute  the  basin  boundaries  and  the  edges 
of  the  Newtonian  graph.  But  first  we  need  a  few  preliminaries  on  cylindric 
algebraic  decomposition. 


3.1  Cell  Decomposition 

We  describe  cylindric  algebraic  cell  decomposition  briefiy.  For  more  detailed 
description,  see  [2]  or  [1]. 

Definition  4  A  decomposition  of  is  a  finite  partition  {CJjg/  such  that 
each  Ci  is  connected,  n  C,  =  0  if  i  ^  j  and  Uie/  =  R*.  For  fc  =  1  such  a 
decomposition  is  cylindric  if  the  each  Ci  is  either  a  point  or  an  interval.  For 
fc  >  1,  the  decomposition  is  cylindric  if  for  all  r,  1  <  r  <  fc,  :  i  € 

/}  is  a  cylindric  decomposition  of  R’’.  □ 

Definition  5  Given  polynomial  equations,  fi{xi,...,Xm)  =  0,  t  =  l,...,n, 
with  fi  €  R[xi,  ...,Xto],  a  Cylindric  Algebraic  Decomposition  (CAD)  of  R”*  is 
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a  data  stnictuie  X>  with  the  following  properties. 

•  V  contains  a  graph,  where  the  nodes  correspond  to  subsets  (cells)  of 
R”,  each  cell  being  homeomorphic  to  for  some  d,  and  the  cells  are 
a  decomposition  of  R”*. 

•  For  all  t  =  1, n,  sign(/i)  is  constant  on  every  cell.  Each  cell  is  labelled 
with  the  signs  that  the  fi  take  on  that  cell. 

•  Every  node  contains  an  oracle  such  that  given  any  c  €  R”*,  the  oracle 
can  answer  if  c  is  in  the  subset  corresponding  to  the  node. 

•  Every  node  contains  dimension  information,  corresponding  to  the  di¬ 
mension  of  the  cell. 

•  The  edges  of  the  graph  correspond  to  adjacency  of  the  cells  in  R”,  t.e. 
there  is  an  edge  (u,  v)  if  the  subsets  that  u  and  v  represent  are  adjacent. 

•  The  decomposition  is  cylindric. 

□ 


Algorithms  have  been  developed  to  compute  (parts  of)  such  a  cell  decom¬ 
position  dating  back  to  Tarski  in  1948  [8].  Collins  [2]  has  a  double  exponential 
algorithm,  although  it  lacks  some  of  the  adjacency  information.  Ben-Or  et 
al.  [1]  developed  a  parallel  algorithm  giving  the  same  kind  of  decomposition 
(the  BKR  algorithm),  and  Kozen  and  Yap  [4]  extended  that  algorithm  to  ob¬ 
tain  the  adjacency  information  as  well  (here  after  named  the  extended  BKR 
algorithm). 

We  note  that  due  to  the  cylindric  condition  and  adjacency  information, 
an  algorithm  computing  such  a  decomposition  can  be  used  on  a  set  of  poly¬ 
nomials  with  quantifiers,  projecting  down  the  result.  If  the  input  is  a  system 
of  polynomials  of  the  form 

3j/l,  ••■J/it  /l(^l»  •••»  ^tn»  yi»  J/fc)  —  0 

/n(®l>  •••»  ®»ni  l/l?  J/fc)  —  0 

then  using  CAD  on  R”*^*  we  can  project  the  solution  down  to  R”*,  by  treating 
the  partitions  according  to  yu  ...,yk  as  insignificant.  The  resulting  structure 
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can  be  used  for  answering  questions  of  the  form:  Given  c  €  R"*,  does  there 
enst  yi,  ...y*  €  R*  such  that  yi,  ...,y*,c  is  a  solution  to  the  system? 

We  note  that  the  order  of  variables  is  important  with  respect  to  the 
cylindric  condition. 

3.2  The  Algorithm 

Recall  that  every  basin  boundary  and  every  edge  is  mapped  by  /  onto  a 
straight  line.  Also,  the  basin  boundaries  and  edges  have  a  root  of  /'  as  a 
limit.  Thus,  all  these  “interesting”  curves  of  flow  satisfy,  for  every  z  on  the 
curve, 

3c€C,m€R  :  f{z)  =  mf{c)  ,  . 

nc)  =  0 

Any  point  z  on  a  basin  boundary  or  an  edge  must  satisfy  these  two  conditions. 
We  note  that  the  converse  is  not  true;  z  €  C  can  be  a  solution  to  (1)  without 
being  on  an  edge  or  a  basin  boundary. 

We  proceed  in  two  steps.  First  we  find  a  decomposition  of  C  describing 
where  we  have  solutions  z  to  (1).  Then  we  prune  that  output,  because  we 
may  get  solution  curves  which  do  not  correspond  to  basin  boundaries  or 
edges. 

To  find  the  solutions  to  (1),  we  can  feed  the  equations 


/(z)  =  m/(c) 

m  =  0 


(2) 


into  our  favorite  cylindric  algebraic  decomposition  algorithm.  The  resulting 
structure  would  be  a  decomposition  of  R  x  C  x  C  describing  regions  where 
such  m,c,  z  exist,  along  with  the  dimension  of  each  region  and  adjacency 
information.  Projecting  m  and  c,  we  get  curves  in  C  for  which  there  exists  a 
solution  to  (1). 

First  let  us  note  that  algorithms  such  as  Collins’  and  the  extended  BKR  do 
decomposition  over  the  reals.  But  we  can  split  the  equations  into  a  real  and 
imaginary  parts,  and  get  a  decomposition  of  R®  «  R  x  C  x  C,  corresponding 
to  the  equations 


fR{x,y)  =  m/fl{ci,C2) 
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fi{x,y)  =  m//(ci,C2) 

/k(ci.C2)  =  0 

fiicuc^)  =  0 

where  f{x  +  iy)  —  fii{x,  y)  +  i//(x,  y)  with  fn,  fj  €  R[x,  y].  We  get  a  decom¬ 
position  on  which  corresponds  to  a  decomposition  on  R  x  C  x  €. 

We  then  project  the  dimensions  corresponding  to  c  =  ci  -I-  ic2  and  again 
project  m,  obtaining  a  decomposition  of  C  corresponding  to  z  for  which  there 
exist  m  and  c  satisfying  equation  (2). 

This  decomposition  will  contain  the  basin  boundaries  and  graph  edges. 
These  may  be  partitioned  into  segments  (bounded  1-cells)  and  0-cells  between 
such  segments.  There  may  be  other  1-cells  present  which  are  not  solutions 
to  the  system  (introduced  by  the  CAD  algorithm  to  get  a  finer  partition). 
However,  we  can  always  identify  the  segments  which  are  a  solution  to  this 
system,  because  all  of  them  are  labelled  with  the  signs  of  the  input  polyno¬ 
mials.  The  curves  which  are  actual  solutions  to  the  system  (2)  will  all  show 
f{z)  =  m/(c).  Hence  a  solution  curve  to  the  system  can  be  reconstructed 
by  Unking  such  adjacent  cells. 

But  not  all  solution  curves  are  edges  or  basin  boundaries.  The  following 
lemma  classifies  the  types: 

Lemma  6  The  output  from  the  process  above  contains  at  most  0{n^)  1 -cells 
which  are  solutions  curves  for  the  input  system.  They  are  of  the  following 
types: 

1.  Adjacent  to  two  0-cells,  one  of  which  describes  a  root  of  f  and  one 
which  describes  a  root  of  either  f  or  f; 

2.  Adjacent  only  to  one  0-cell  which  describes  a  root  of  /'; 

3.  Adjacent  only  to  one  0-cell  which  describes  a  root  of  f. 

Cells  of  type  1  are  edges  of  the  Newtonian  graph,  a  cell  of  type  2  is  a  basin 
boundary  and  cells  of  type  3  are  extraneous  solutions  to  the  system. 

Proof.  The  only  1-dimensional  ceUs  that  can  be  solutions  to  the  system 
correspond  to  curves  of  flow.  Then  the  classification  is  obvious  from  the 
definition  of  edges  and  the  properties  of  basin  boundaries. 
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There  are  at  most  O(n^)  solution  curves  for  f{z)  =  Tnf{c)  with  c  a  root 
of  f{c)  and  m  €  R,  because  there  are  at  most  n  -  1  roots  of  /'  and  /  is  an 
n  to  1  mapping.  □ 

The  celb  of  type  1  and  2  are  the  ones  we  are  interested  in  and  we  must 
tell  these  apart  from  the  extraneous  cells  of  type  3. 

Since  /  is  a  part  of  the  input,  the  signs  of  /  are  given  on  every  cell.  In 
particular  this  allows  us  to  verify  if  a  curve  ends  at  a  root  of  /. 

Depending  on  which  algorithm  we  use,  we  may  or  may  not  have  all  the 
information  needed.  The  extended  BKR  guarantees  that  if  /  is  a  part  of  the 
input,  then  the  signs  of  f  will  be  provided  on  each  cell.  If  we  don’t  have  this 
guarantee,  we  can  always  add  f'{z)  =  0  to  our  input  equations  and  get  the 
same  information  that  way. 

At  this  point  we  can  determine  the  types  of  the  solution  curves.  Now 
it  is  easy  to  implement  the  promised  “pruning”  step.  We  simply  eliminate 
all  cells  of  type  3.  More  precisely,  we  mark  them  as  parts  of  the  adjacent 
2-dimensional  cells  (which  are  the  basin  that  this  cell  lies  in). 

Now  the  structure  can  be  used  in  answering  queries.  Two  points  are  in 
the  same  basin  if  they  are  in  the  same  2-cell  or  if  they  are  separated  only  by 
“fake”  1-cells  (of  type  3). 


4  Improvements 

Recall  we  did  cylindric  decomposition  on  K®  R  x  of  the  equations 

f(z)  =  m/(c) 

/'(c)  =  0 

and  projected  the  solution  onto  C  This  can  be  simplified  by  defining 

g{m,z)  =  Resc{fiz)-mf{c),f'{c)), 

where  ReSc  denotes  the  univariate  resultant  of  two  inputs,  considered  as  poly¬ 
nomials  in  c.  (Here  we  view  f{z)  —  mf{c)  and  /'(c)  as  univariate  polynomials 
in  C[z,m]\c]). 

Then  g  has  the  property  that  p(m,  z)  =  0  iff  3c  e  C  :  J{z)  ~  mf{c)  = 
0  =  /'(c).  Hence,  a  decomposition  of  R  x  C  with  respect  to  g  is  the  same  as 
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the  projection  of  the  decomposition  of  R  x  C  x  C  with  respect  to  the  original 
two  equations. 

The  only  thing  we  must  be  aware  of  is  how  to  obtain  the  necessary  signs 
of  /  and  f  on  cells,  in  order  to  identify  and  link  up  solution  curves  and  prune 
off  the  redundant  ones.  One  way  would  be  to  add  the  equation  f{z)  =  0  (and 
f'{z)  =  0,  if  we  are  not  using  the  extended  BKR),  and  do  a  decomposition 
with  respect  to  /  (/')  and  g.  This  is  already  an  improvement  in  terms  of 
dimensions,  since  we  are  only  working  with  3  real  variables  (x  =  Re(z),y  = 
Im(z)  and  m)  instead  of  5  before. 

The  asymptotic  complexity  remains  the  same,  but  the  constants  are 
clearly  much  better.  The  extended  BKR  gives  an  NC  circuit  of  depth 
n  where  d  is  the  number  of  variables  and  n  is  the  maximum 
of  either  the  number  of  polynomials  or  their  degrees.  In  our  case  the  circuit 
will  be  of  depth  0(log^^^^  n)  where  n  is  the  degree  of  the  input  polynomial 
/. 

5  Applications 

The  Newtonian  graph  is  of  its  own  interest,  as  it  describes  the  arrangement 
of  the  roots  of  /  and  Our  computation  gives  a  complete  topological 
information  of  both  the  graph  and  the  basins  of  the  Newtonian  field. 

The  relation  to  Newton’s  method  gives  an  interesting  method  of  approx¬ 
imating  all  the  roots  of  /  simultaneously,  guaranteeing  convergence.  If  we 
start  with  a  point  zq  in  a  basin,  we  can  apply  modified  Nev^on’s  method, 
where  the  iteration  zj+i  *-  Zk  +  Nf{zk)  is  replaced  by: 

a  ^  1, 
repeat 

Zk+i  ^  Zk  +  aNf{zk) 
a  <—  a/2 

until  (zk^i  is  in  the  same  basin  as  Zk) 

I.e.  we  scale  down  the  step  in  order  to  ensure  that  we  stay  within  the  same 
basin.  Here  we  use  our  pre-computed  structure  of  basins  to  determine  if  tv  o 
points  are  in  the  same  basin.  If  we  furthermore  require  that  a  progress  is 
made  at  each  step,  {i.e.  that  |/(z*+i)|  <  \f{zk)\,  then  we  are  guaranteed  that 
the  sequence  {z*}  will  eventually  converge  to  the  root  in  the  basin  of  zq. 
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We  remark  that  this  does  not  guarantee  quadratic  convergence  every¬ 
where,  it  only  ensures  that  the  method  will  converge. 

6  More  General  Newtonian  Graphs 

In  this  extended  abstract,  we  will  briefly  describe  how  the  Newtonian  graph 
and  its  computation  extend  to  more  general  vector  fields  on  C 

The  definition  of  a  Newtonian  graph  (Definition  3)  only  uses  the  fact  the 
the  function  /  :  IV  — ►  C  is  on  an  open  subset  W  C  C,  which  makes 
^f(^)  =  ^  vector  field  on  W.  Lemma  1  still  holds,  but 

Lemma  6  now  allows  curves  parameterized  (— oo,  -f-oo)  coming  in  from  eithtir 
infinity  or  a  pole  of  /  and  going  to  either  infinity  or  a  root  of  /. 


Figure  4:  Flow  in  the  Newtonian  field  of  a  rational  function  with  three  roots 
and  four  poles.  Curves  of  fixed  color  indicate  curves  of  flow. 
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In  particular,  most  of  the  same  observations  hold  for  rational  functions. 
The  function  /  maps  the  curves  of  flow  onto  straight  line  segments  pointing 
to  the  origin.  The  natural  deflnition  of  a  Newtonian  graph  for  the  rational 
function  is  the  graph  whose  edges  are  the  directed  curves  from  poles  or  inflnity 
to  roots  of  /':  between  the  roots  of  /'  and  frorr  '^ots  of  /'  to  roots  of  /  or 
to  infinity.  This  differs  only  slightly  from  Sme  efinition  [6]  in  that  for 
polynomials  we  now  count  the  basin  boundaries  as  a  part  of  the  graph.  A 
nice  property  of  the  graph  is  that  it  is  symmetric  in  poles  and  roots,  t.e.  the 
graphs  of  A/p/,  and  A/,/p  are  identical,  except  the  directions  of  the  edges  are 
reversed. 

Write  f{z)  =  p{z)lq{z)  with  p^q  €  C{z]  wli,n  gcd{p,q)  =  1.  If  c  is  a  root 
of  /'(c)  =  0  then  /  maps  any  e  ’  ^e  into  c  onto  a  ray  {m/(c)  :  0  <  m  <  1}. 
Again  we  can  consider  a  system  of  equations 

3c€C,m6R  :  f(z)  =  mf{c) 

m  =  0 

which  now  is  equivalent  to  the  system  of  polynomials 

3c  €  C,  m  €  R  :  p{z)q{c)  =  mq(z)p(c) 
p'(c)q(c)  -  p(c)q'(c)  =  0. 

This  we  can  solve  with  algebraic  decomposition  as  before  and  determine  the 
actual  solution  curves  which  are  edges.  As  before  we  can  reduce  the  number 
of  variables  by  using  resultants.  Let 

g(m,z)  =  Il€Sc(p(z)q(c)mq(z)p(c},p'(c)q(c)  -  p(c)q'{c)). 

Then  the  previous  system  is  equivalent  to  3m  g(m,  z)  =  0.  As  before  we 
have  devised  an  NC  algorithm  to  compute  the  Newtonian  graph. 

The  key  property  used  in  the  computation  is  that  the  flow  satisfies  /(0*(t))  = 
In  the  case  of  a  rational  function  f{z)  =  p{z)fq{z)  this  translated 
into  the  polynomial  equation 

-  e'^P{z)q{Mt))  =  0- 

This  equation  also  implicitly  defines  the  flow  6^  satisfying  0,(0)  =  2. 
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In  more  generality,  consider  any  flow  defined  on  IT  C  C,  where  IV 
is  almost  all  of  C  Assume  for  some  polynomial  g  with  complex  coeffi¬ 
cients  and  for  all  2  G  W,  <^,(0)  =  z  and  =  0.  Then  we  have 

Dg{<t>z{t),e\z)  =  0,  i.e. 

Dig{<t>z{t),e\z)4>\{t)  + D2g{4>z(t),e\z)  =  0 
which  is  equivalent  to 

<i>'z{t)  ^  -D2g{4>z{t),e\z)IDig{(t>z{t),e\z).  (3) 

If  a  basin  of  attraction  is  the  area  where  all  the  flow  goes  from  one  pole  to 
one  root  then  as  before  there  is  a  discontinuity  in  the  fleld  somewhere  along 
every  basin  boundary.  In  particular,  4>'  will  be  undeflned  at  such  points. 
Equation  (3)  shows  that  extends  continuously  to  all  of  C  except  the 
points  where  Dig{<p^{t),e\  z)  =  0  and  D2g{(l>z{t),e\  z)  #  0.  These  points 
can  be  computed  as  being  the  i  €  C  for  which  there  exist  m,  m'  G  IR  and 
z,x'  e  C  with 


g{x,m,z)  =  0 
g{x',m\z)  =  0 
Dtg{x\m',z)  =  0, 

The  first  two  conditions  force  x  and  x‘  to  be  on  the  same  curve  whereas  the 
third  condition  places  x'  at  a  discontinuity  of  the  field.  It  is  easy  to  verify 
that  for  rational  functions  we  get  the  same  equations  as  before. 

Again  we  can  do  CAD  on  R®  and  project  down  for  the  x  variable  to  obtain 
the  solution  curves,  which  then  are  the  edges  of  the  Newtonian  graph. 
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ABSTRACT .  Because  of  the  general  to  specific  nature  of  the 
backward  ordered  reasoning  (from  goals  to  input  data)  in  some 
expert  systems,  it  is  hard  to  organize  sets  of  rules  that  lead  to 
multiple  goals.  In  classification  type  expert  systems,  in 
particular,  it  is  often  difficult  to  organize  the  rules.  Normally 
it  is  desired  that  under  all  circiunstances  they  ask  for  all  the 
information  required.  They  should  then  conclude  with  a  report 
which  contains  all  the  conclusions  the  system  should  reach  in 
this  situation.  In  this  project,  an  auxiliary  computer  program 
was  written  to  topologically  sort  the  120  rules  in  the  knowledge 
base  of  an  expert  system.  The  conclusions  of  the  rules  were  used 
as  the  means  by  which  to  define  a  partial  order  of  the  logic  flow 
in  the  knowledge  base. 

Key  words  -  Expert  System,  mixed  chaining,  knowledge  base, 
topological  sort. 

INTRODUCTION .  Generic  categories  of  expert  systems 
applications  include  decision  management,  diagnosis 
/troubleshooting  (determining  malfunctions  from  symptoms  and 
pther  observable  facts) ,  classification  and  interpretation  of 
situations  (concluding  situation  descriptions  from  the  data  and 
facts  encountered) ,  planning  and  scheduling  analysis, 
manufacturing  design,  configuring  objects  under  constraints, 
instruction  and  intelligent  dociimentation,  configuration  design, 
and  process  control  (programs  to  govern  the  overall  behavior  of 
systems) . 

In  1989  the  US  Army  Engineer  Waterways  Experiment  Station 
(WES)  established  a  research  and  development  work  unit  within  the 
Civil  Works  Research  and  Development  Program's  Flood  Control 
Channels  Budget  Package  entitled  "Gravel  and  Boulder  Rivers" 
(#32553).  This  effort  has  two  major  goals:  the  first  being  to 
develop  an  understanding  of  the  physical  sedimentary  processes  in 
rivers  and  streeuas,  the  second  being  to  develop  a  conceptual 
model  of  these  processes.  An  initial  streeun  reach  inventory  form 
was  developed  and  validated  during  1989-1991.  Based  on  the  data 
gained  by  a  nationwide  inventory  conducted  by  MCI  Consulting 
Engineers,  INC.  for  WES,  a  lack  of  understanding  of  and  data  for 
boulder /gravel  systems  became  apparent.  Work  was  done  to: 

a.  Establish  a  systemic  procedure  for  collecting  and 
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analyzing  geonorphological,  geometric,  hydraulic,  and  sedimentary 
data  using  a  stream  reach  inventory  process. 

b.  Identify  sediment  sources  and  deposition  zones. 

c.  Identify  channel  bed  and  bank  forms  which  are  hydraulic 
inf luencers. 

d.  Relate  channel  processes  to  channel  features  and  link  the 
sedimentation  patterns  to  river  engineering  factors. 

Efforts  were  conducted  to  develop  technical  guidance  documents 
for  use  by  District  personnel.  The  end  product  was  envisioned  to 
be  the  basis  for  uniform  data  collection  methods  for 
boulder /gravel  river  systems.  As  a  result  of  work  conducted  on 
this  project  in  1992  an  existing  set  of  separate  stream  bed 
channel  flow  rules  was  organized  into  a  classification  type 
computer  expert  system  by  the  author  using  the  methodology 
explained  in  this  report. 


This  paper  describes  a  knowledge  based  expert  system 
entitled  CHANNEL*-FIX.  The  program  is  intended  to  serve  the 
hydraulic  engineer  as  a  Boulder /Gravel  River  sedimentation 
analysis  tool.  CHANNEL-FIX  provides  guidance  in  the  fluvial 
georaorphic  processes  occurring  in  a  Boulder /Gravel  river  reach 
linked  to  5  of  the  6  stream  channel  design  variables. 

In  terms  of  the  description  of  how  the  rules  are 
implemented,  the  scope  of  this  study  is  limited  to  a  particular 
version  of  the  expert  system  software  shell  used  (Levels  ver  1.1 
for  the  Macintosh) .  The  general  procedures  to  be  explained  in 
this  report  are  applicable  to  this  type  of  software  expert  system 
shell  in  general  but  the  specific  syntax  and  grammar  of  the  rules 
in  the  knowledge  base  and  system  specific  functions  will  be 
different  for  other  shells. 


Since  the  stream  bed  flow  expert  system  program  falls 
under  the  type  of  expert  system  used  for  classification  and 
interpretation  of  situations  some  of  the  specific  characteristics 
for  expert  systems  in  these  areas  will  be  briefly  described 
below. 


Classification  expert  systems  help  the  user  to  choose 
products,  procedures,  or  processes  from  a  large  or  complex  set  of 
alternative  possibilities.  These  programs  identify  a  hypothesis 
based  on  the  pattern  of  data  that  the  user  enters  in  response  to 
a  series  of  questions.  Since  the  questions  are  asked  in  response 
to  a  set  of  presupplied  hypotheses  (that  is  they  are  framed  and 
scheduled  from  the  general  to  the  specific)  these  systems  are 
basically  backward-chaining.  However,  as  will  be  explained  below, 
in  some  situations  the  information  that  accumulates  as  the  data 
is  entered  may  influence  the  order  in  which  the  questions  should 
be  asked.  To  take  account  of  this  the  knowledge  base  and 
inferencing  strategy  also  need  to  continue  accumulating 
information  even  after  each  partial  conclusion  is  reached.  This 
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■ay  require  the  expert  system  to  start  back  through  the  rules 
again  or  iterate  repeatedly  by  forward  chaining  through  the  rule 
sets  several  tines. 

The  following  short  glossary  defines  soae  of  basic  terns 
that  will  be  used  in  explaining  the  problens  that  arose  in 
designing  the  knowledge  base  for  the  expert  system  shell: 

Terminology 

A  short  list  of  some  basic  expert  system  terminology  is 
listed  below 

Attribute  —  Defines  the  qualities  or  values  contained  in  a  class 
and  the  type  of  information  that  make  up  a  class.  For  example, 
the  class  car  can  have  the  attributes  "type  of  engine"  and  "top 
speed" . 

Attribute  value  —  An  actual  number  or  confidence  factor 
representing  the  degree  of  certainty  with  which  a  factor  is 
known. 

Backward-Chaining  —  An  inferencing  strategy  that  is  structured 
from  the  general  to  the  specific.  That  is,  it  starts  with  a 
desired  goal  or  objective  and  proceeds  backwards  along  a  series 
of  deductive  reasonings  while  it  attempts  to  collect  the 
hypotheses  required  to  be  able  to  conclude  the  goal.  This  process 
continues  until  the  goal  is  reached  and  it  then  displays  its 
conclusion.  (See  following  sections  for  a  more  complete 
explanation  and  an  example. 

Class  —  Defines  the  structure  (in  terms  of  its  attributes)  and 
behavior (in  terms  of  its  associated  methods  and  procediures)  of  an 
object.  When  it  becomes  an  instance,  it  then  holds  the  actual 
data  values  of  a  particular  realization  of  this  type  of  object  in 
the  knowledge  base.  For  example:  a  class  called  hvunan  beings 
might  have  attributes  related  to  the  parts  that  differentiate  our 
physical  beings  and  categories  such  as  those  related  to  its  our 
mental  and  spiritual  capacities.  Some  of  the  associated  methods 
and  procedures  of  this  class  could  be  thinking,  talking,  walking. 
It  can  be  considered  as  a  subclass  of  another  class  such  as  the 
class  of  living  beings.  The  author  and  the  reader  are  both 
specific  instances  of  a  human  being  object. 

Antecedent  —  The  IF  part  of  a  conditional  statement  (synonymous 
with  the  tera  hypothesis  in  »fhat  follows) . 

Consequent  —  The  THEN  part  of  a  conditional  statement 
(synonymous  with  the  term  conclusion  in  what  follows) . 


See  also,  the  Levels  object  for  Windows  Users  guide.  Clips 
users  manual,  and  a  guide  to  expert  systems  by  Waterman  all  of 
which  are  listed  in  the  bibliography  at  the  end  of  the  report. 
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Control  Rule  —  A  rule  in  the  knowledge  base  that  controls  the 
order  in  which  data  is  assimilated  into  the  knowledge  base. 

Goal  —  A  top-level  consequent  of  the  rules  in  the  knowledge 
base  toward  which  Backward-Chaining  may  be  directed.  (It  is  a 
hypothesis  that  the  program  will  try  to  determine  if  some  group 
of  rules  can  be  instantiated  together  to  satisfy) 

Inference  Mechanism  —  The  component  of  the  expert  system  shell 
responsible  for  using  the  rules  in  the  knowledge  base  to  derive 
new  facts  from  known  information. 

Instance  or  Instantiation  —  Specific  occurrence  of  an  object. 

An  object  consists  of  its  class  structure,  which  defines  its 
attributes  and  behavior  and  its  instances,  which  hold  the  actual 
values  of  the  object.  An  instance  of  the  class  human  beings 
mentioned  above  would  refer  to  an  individual  person,  such  as  the 
reader  of  this  report. 

Knowledge  Tree  -  A  graph  showing  the  logic  and  data  flow 

connections  between  rules  and  facts  in  the  knowledge  base.  A 
knowledge  tree  presents  a  graphical  representation  of  the 
complete  structure  of  the  knowledge  base. 

Method  -  A  procedure  stored  in  an  object's  class  structure 

that  can  determine  an  attribute's  value  when  it  is  needed  in  the 
program  ,  referenced  in  its  class,  or  required  to  execute  a 
series  of  procedures  because  another  value  in  the  program 
changes.  “When  needed  methods"  are  executed  during  backward 
chaining  to  determine  an  attribute's  value.  "When  changed 
methods"  implement  a  procedure  when  a  given  attribute  changes. 

Node  -  A  vertex  or  point  in  the  knowledge  tree  connecting  the 

antecedents  and  consequents  of  rules  in  the  knowledge  base.  In 
most  conventions  the  nodes  are  the  rules  and  the  antecedents  and 
consequents  are  the  edges  between  the  nodes  or  vertices. 

Object  —  General  term  for  a  programming  entity  that  has  a  record 
type  data  structure  along  with  attribute  values  and  procedures 
that  enable  it  to  represent  something  concrete  or  abstract.  It 
can  be  contrasted  with  other  programming  entities  such  as  facts, 
rules,  procedures,  or  methods.  An  object's  structure  is  defined 
by  its  class  and  attribute  definitions.  A  class  declaration  is  a 
data  template  involved  in  representing  knowledge  which  defines 
the  structure  of  an  object.  For  example,  in  the  class  "human 
being"  mentioned  above  some  of  the  attribute  slots  might  be  size, 
weight,  hair  color,  and  so  forth. 

Expert  System  —  A  computer  program  that  represents  and  uses 
expert  human  knowledge  to  attain  high  levels  of  performance  in  a 
problem  area.  An  expert  system  has  two  basic  components:  a 
knowledge  base  which  contains  the  information  (facts,  rules,  and 
methods)  found  in  the  subject  area  of  the  problem  area  being 
represented,  and  an  inference  engine  or  mechanisms  that  make  use 
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of  the  knowledge  base  (by  scheduling  and  interpreting  the  facts, 
rules,  and  methods)  to  make  conclusions  and  decisions  and  solve 
problems  that  would  normally  take  a  human  expert  more  effort. 

Expert  System  Shell  —  The  interactive  programming  environment  on 
the  computer  into  which  the  user  enters  information,  rules,  and 
goals  and  which  compiles  the  knowledge  base,  then  runs  the 
resulting  expert  system  program. 

Forward-Chaining  —  Forward-chaining  reasoning  is  an  inferencing 
strategy  in  which  the  questions  are  structured  from  the  specific 
to  the  general.  That  is,  it  starts  with  user  supplied  or  known 
facts  or  data  and  concludes  new  facts  about  the  situation  based 
on  the  information  found  in  the  knowledge  base.  This  process  will 
continue  until  no  further  conclusions  can  be  reached  from  the 
user  supplied  or  initial  data  (using  the  rules  and  methods 
contained  in  the  knowledge  base) .  (See  following  sections  for  a 
more  complete  explanation  and  an  example) . 

Vertex - Same  as  node.  (See  above) 
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EXAMPLE  OF  HOW  THE  RULE-BASED  SYSTEM  CAN  CLASSIFY  THE  PLANIFORM 
STABILITY  OF  A  REACH  IN  AM  ACTUAL  STREAMBED 


The  WES  CHANNEL-FIX  rule  based  system  contains  about  80 
rules.  101  facts  in  950  lines  of  computer  code  for  a  Macintosh 
personal  computer.  Once  the  program  is  started  on  the  computer,  a 
screen  appears  which  explains  the  system  which  is  driven  by 
graphical  menus  and  buttons.  The  user  enters  information  into  the 
program  by  either  clicking  buttons  on  the  screen  with  a  cursor 
directed  by  a  keyboard  mouse  or  by  typing  text  from  the  keyboard 
in  order  to  answer  the  guest ions  that  appear  on  the  screen. 
Certain  menu  choices  or  guestions  in  the  program  are  preceded  by 
explanatory  pictures  on  the  screen.  These  pictures  give  the  user 
a  graphical  explanation  of  some  of  the  menu  choices  that  are 
displayed.  Also,  when  the  explain  button  appears  above  the 
guestion  area  on  the  screen  window,  the  program  will  display 
explanatory  text  when  this  button  is  clicked.  When  the  system  has 
asked  all  the  questions  that  it  needs  to  determine  which  rules 
and  facts  may  be  applicable  to  the  situation  the  session  will 
occur  a  summary  of  all  conclusions  and  determinations  will  be 
printed  out  on  the  screen  and  saved  in  a  file  in  the  program's 
directory  work  area.  At  any  time  during  the  series  of  questions 
that  the  program  makes  a  partial  conclusion  the  user  may  click  on 
the  explain  button  to  see  displayed  which  rules  and  facts  were 
used  to  make  that  particular  conclusion. 

As  an  example  of  the  steps  involved  in  using  the  program  we 
will  display  the  questions  and  determinations  for  a  session  in 
which  the  user  enters  the  information  for  a  reach  in  the  North 
Fork  Licking  river.  Normally  a  reach  of  the  river  would  be 
determined  from  the  data  from  several  cross-sections  at  the  site. 
After  the  initial  screen  appears  the  next  step  in  the  program  is 
normally  initiated  by  clicking  on  the  continue  button  that 
appears  above  the  question  area  in  the  program's  screen  window. 

a.  SAMPLE  PROGRAM  RUN  CROSS  SECTION  #1  NORTH  FORK  LICKING  RIVER 

(1)  The  program  asks  for  the  name  of  the  river  which  the 
user  enters  as  North  Fork  Licking  in  this  case. 

(2)  The  program  then  asks  for  the  type  of  bar  that  is 
present  in  this  reach  of  the  river  survey.  In  this  case  the  user 
responds:  Point  Bar. 

(3)  The  program  then  asks  for  the  active  channel  width^  in 
feet.  This  is  entered  as  75. 

(4)  The  program  then  asks  for  the  slope  of  the  river  bed  and 
the  water  surface  slope  at  this  point  in  the  stream.  The  answers 
entered  in  this  case  are:  .01  and  .02. 


^  See  the  reference  Harrell [1993]  for  the  definition  of  the 
hydrologic  terms  used  in  this  example. 


75 


(5)  The  program  then  asks  the  user  whether  fines  are  present 
on  the  bar  surface.  The  answer  is  Yea  in  this  exai^ple. 

(6)  The  program  then  asks  whether  large  clasts  are  in  direct 
contact  at  this  point  in  the  reach.  The  answer  is  MO  in  this 
example . 

(7)  The  program  then  asks  whether  imbricatio  Is  present. 

The  answer  give  is  Yes. 

(8)  The  program  then  asks  whether  you  can  identify  the 
evidence  of  fresh  scour  on  the  outside  bank.  The  answer  given  is 

Yes. 

(9)  The  program  then  asks  whether  there  are  fresh  deposits 
on  the  bar.  The  answer  given  is  Yes. 

(10)  The  program  then  asks  what  is  the  average  depth  of  the 
active  channel.  The  answer  given  is  2  ft. 

(11)  The  program  then  asks  whether  there  is  fresh  scour  on 
the  bar.  The  answer  given  is  No. 

(12)  The  program  then  asks  whether  there  are  diffuse  gravel 
sheets.  The  answer  given  is  Yes. 

(13)  The  program  then  asks  whether  the  Main  Channel  is 
increasing,  stable,  or  decreasing.  The  answer  given  is 
increasing. 

The  program  then  concludes  the  session  and  prints  out  a 
screen  displaying  all  the  conclusions  reached.  This  information 
is  shown  below: 

Based  on  your  description  of  this  reach  of  North  Fork  Licking 
River,  the 

following  conclusions  can  be  drawn: 

The  sedimentary  structure  of  the  bar  is  Matrix  Gravel 
Large  clast  are  not  typically  in  direct  contact  in  Matrix 
gravels. 

The  matrix  consists  of  30%  or  more  sediment  finer  than  fine 
gravel.  Fluvial  action  will  rapidly  entrain  the  matrix  sediment 
reducing  the  stability  of  the  gravel  clast. 

This  erosional  process  occurs  at  mean  flow  or  higher.  Field 
data  indicates  the  even  burial  of  clast  to  75%  does  not  increase 
stability. 

Tr?  've  or  shear  stress  produced  by  mean  flow  will  entrain 
the  matr  finer  grain  sediment.  The  lack  of  clast 

interlock!.  ^  chat  is  present  in  framework  gravel  reduces  the 
stability  although  there  is  a  high  per  cent  of  fine  grained 
matrix  material. 

Matrix  gravel  units  appear  to  be  a  grouping  clusters.  This 
lack  of  stability  and  high  erodibility  factor  leads  the 
assignment  of  a  stability  rating  of  4. 

The  relative  stability  of  the  bar  (from  1  to  4)  is:  4.00 

The  Active  Channel  Width  is  probably  increasing 
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The  slope  is: increasing 

The  neander  pattern  is: decreasing 

The  stability  of  the  channel  ls:decreasing 

Conclusion:  The  channel  is  migrating  to  the  outside 

The  bank  is  providing  transported  sediment. 

The  sediment  transport  is: increasing 
The  Main  Channel  Depth  is  stable 
The  stability  of  the  planiform  is:  2.00 

where  -i-l  >■  change/ increase 
0  a  neutral 
-1  »  change/decrease 

The  bank  is  eroding:  True 

The  bar  is  eroding:  False 

The  bar  is  migrating:  False 

The  bar  is  growing:  True 


Report  of  conclusion  for  North  Fork  Licking  River  complete. 
***End  of  session*** 


Note,  that  in  this  example  the  program  had  to  ask  the  question 
whether  the  Main  Channel  was  increasing,  stable,  or  decreasing. 
For  another  set  of  reach  information  it  is  possible  that  the 
rule-based  system  would  have  been  able  to  determine  this  from 
information  already  entered.  In  general,  there  are  not  enough 
rules  to  determine  all  the  conclusions  that  may  be  required  in 
order  to  proceed  completely  with  any  given  set  of  facts.  The 
program  will  then  request  the  user  to  supply  the  answer  to  the 
missing  information.  The  purpose  of  sorting  the  rules  as 
explained  earlier  in  the  report  in  terms  of  the  information 
required  in  the  hypotheses  of  each  rule  is  that  the  program  will 
in  all  cases  be  able  to  proceed  in  a  single  progreun  run  in  a 
manner  which  extracts  all  the  information  required  to  medce  all 
possible  determinations  that  the  rule-base  will  permit. 


If  we  examine  the  stream  bed  flow  program  we  see  that  it  is 
a  rule-based  system  which  collects  or  makes  a  report  of  a  series 
of  conclusions,  not  just  one.  Therefore  it  does  not  fall  in  the 
area  of  backward  goal  searching  diagnostic  programs  in  which  the 
questions  are  structured  from  the  general  to  the  specific.  It  is 
a  forward  chaining  rule-based  system  in  which  the  information  is 
accumulated  by  asking  a  series  of  questions  which  are  structured 
from  the  specific  to  the  general. 

The  order  of  the  goals  in  the  program  was  restructured  in 
order  to  make  it  consider  all  the  rules  in  a  single  progreun  run. 
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An  abbreviated  forward  chaining  flow  chart 'for  stream  bed 
flow  rule  system  as  it  now  exists  is  illustrated  below: 


step  1 

Determine  the  river  name 
Determine  thc^  type  of  bar 
Determine  the  active  channel  width 

step  2 

Determine  bar  composition 
step  3 

Determine  bar  stability 

Determine  if  the  bank  and/or  bar  is  eroding  or  not 
Determine  channel  depth 
Determine  channel  slope 
Determine  if  the  bar  is  migrating  or  not 

step  4 

Determine  if  the  active  channel  width  is  changing 
Determine  if  the  main  channel  depth  is  changing 
Determine  if  the  bar  and/or  the  bank  is  providing 
transported  sediment 

Determine  if  the  bar  is  eroding  faster  than  the  bank 

step  5 

Draw  conclusions  about  the  present  state  of  the  width,  such 

as  a  point  bar  is  forming 
Draw  conclusions  about  the  effect  of  the  bar  on 

the  active  channel  width 

Draw  conclusion  if  deposition  is  occurring  on  the  inside  of 

the  bend 

Draw  conclusions  about  the  affect  of  increasing  channel  width  on 

slope 

Draw  conclusions  about  the  affect  of  diffuse  gravel  sheets  on 

slope 

Draw  conclusions  about  the  affect  of  slope  on  channel  stability 

step  6 

file  a  report  of  all  the  information  entered 
and  conclusions  reached 
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The  reason  for  this  organization  comes  from  both  the  way  the 
software  is  written  and  the  type  of  knowledge  base  that  we  want 
to  create.  The  rest  of  the  report  will  further  elaborate  on  the 
organization  and  explain  how  it  was  arrived  at. 

The  version  of  LEVELS  that  was  used  for  this  study  is 
a  backward  chaining  (goal  driven)  PROLOG^  ^type  expert  system 
shell  based  on  predicate  calculus.  It  provides  a  good  graphical 
user  interface,  built-in  database  search  predic=»tes,  and  some 
object-  oriented  features.  For  classification  problems  which  are 
data-  driven  and  for  which  you  need  to  record  everything  that  can 
be  determined  about  the  situation,  a  forward-  chaining  LISP®  ® 
or  CLIPS^  type  system  with  more  object-  oriented  features  is 
better . 


^  "A  logic  programming  language  based  on  predicate 
calculus”.  Barker,  1988. 

^  The  book  "PROLOG,  Programming  for  Artificial 
Intelligence",  by  Ivan  Bratko,  Addison-Wesley  Publishing  Co, 1986 
contains  a  well-written  and  readable  guide  to  understanding  how 
Prolog  type  expert  system  programs  work.  See  also,  "Logic 
Programming  and  Knowledge  Engineering"  by  Tore  Amble,  Addison 
Wesley  Publishing  Co., 1987. 

®  LISP  -  "A  programming  language  well  suited  for  list 
processing  and  symbolic  manipulation.  It  is  currently  the  most 
popular  AI  language  in  the  United  States",  Barker,  op.cit. 

®  LISP  3rd  ed. ,  by  Patrick  Henry  Winston  and  Berthold  Horn, 
Addison  Wesley  Pxiblishing  Co,  1989. 

^  CLIPS  User's  Guide,  by  Joseph  C.  Giarratono,  NASA  Lyndon 
B.  Johnson  Space  Center,  Information  Systems  Directorate, 
Software  Technology  Branch,  1991. 


A  Brief  Description  of  the  Way  Porwerdi  end  Beokwerd  Chaining 
Works 


In  forward  chaining,  the  inference  mechanism  starts  by 
evaluating  the  first  rule  in  the  knowledge  base.  If  the 
antecedent  of  that  first  rule  is  true,  then  the  consequent  of  the 
rule  is  used  to  search  for  a  conditional  with  an  antecedent 
identical  to  the  previous  consequent.  This  forward  chaining 
continues  until  the  system  is  unable  to  match  a  consequent  with 
an  antecedent.  Because  the  system  reasons  from  the  information  or 
data  provided,  this  form  of  processing  is  said  to  be  data  driven. 
The  two  rules  below  will  ser^'e  as  a  demonstration  of  this 
process : 


IF  A  IS  true,  THEN  B  IS  true 
IF  B  IS  true,  THEN  C  IS  true 

a.  The  following  steps  define  how  forward  chaining  could  be 
applied  to  the  rules  above: 

(1)  If  "A  is  true"  is  known,  the  inference  mechanism  will 
prove  "B  is  true"  by  modens  ponens®. 

(2)  The  system  then  searches  forward  for  a  rule  that  has  an 
antecedent  that  matches  the  consequent  "B  is  true".  A  match  is 
found  in  the  second  rule. 

(3)  Again  the  law  for  modens  ponens  is  used  to  prove  that  "C 
is  true",  since  no  further  rules  can  be  found  with  antecedents 
that  match  consequents,  the  system  will  offer  "C  is  true"  as  its 
conclusion. 

In  backward  chaining,  the  inference  mechanism  starts 
with  a  goal  and  seeks  to  find  a  rule  with  that  goal  as  its 
consequent.  It  then  verifies  whether  or  not  that  rule  can  be 
derived  from  another  rule  by  finding  another  rule  whose 
consequent  matches  its  antecedent.  This  process  of  backward 
chaining  continues  until  a  rule  is  found  that  has  an  independent 
antecedent.  Thus,  backward  chaining  is  actually  goal  driven  in 
its  problem  solving  strategy. 

The  example  below  demonstrates  the  implementation  of  this 
concept: 

Goal  statement 


®  "A  rule  of  inference  that  states:  IF  A  implies  B  and  A  is 
known  to  be  true,  then  B  is  true. 
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D  IS  true 


Production  rules 

RULE  1 

IF  A  IS  true 

THEN  B  IS  true 

RULE  2 

IF  B  IS  true 

THEN  C  IS  true 

RULE  3 

IF  C  IS  true 

THEN  D  IS  true 

b.rhe  first  statement  in  the  example  above,  "D  IS  true,"  is 
the  goal  for  this  knowledge  base.  The  following  steps  explain 
how  LEVELS 's  inference  mechanism  backward  chains  to  prove  this 
goal: 


(1)  The  system  begins  by  searching  for  a  rule  with  the  goal 
"D  IS  true"  as  its  consequent.  Since  Rule  3  satisfies  this 
condition,  the  program  backward  chains  to  check  if  the  antecedent 
"C  IS  true" 

can  be  derived  from  another  rule. 

(2)  It  is  discovered  that  Rule  2  does,  in  fact,  have  a 
consequent  that  matches  the  antecedent  of  Rule  3.  The  program 
will  now  test  to  see  if  the  antecedent  of  Rule  2,  "B  IS  true", 
can  be  derived  from  another  conditional . 

(3)  Rule  1  has  a  consequent  that  matches  the  hypothesis  in 
Rule  2. 

LEVELS  searches  once  more  for  other  supporting  rules.  Since  none 
can  be  found,  the  program  asks  the  user: 

Is  it  true  that: 

A  IS  true 

If  the  user  answers  yes,  the  inference  mechanism  is  able  to 
reach  the  conclusion  that  "D  IS  true"  based  on  the  law  of 
hypothetical  syllogism^. 


®  A  rule  of  inference  that  states:  IF  A,  then  B.  If  B,  then 
C.  Therefore,  If  A,  then  c. 
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Suppose  we  try  and  create  a  fully  backward  chaining 
problem  solving  strategy  to  implement  our  stream  bed  flow  expert 
system  rule  base.  We  need  one  primary  goal  which  all  the  other 
rules  work  backward  toward  solving;  a  series  of  secondary  goals 
each  of  which  has  information  needed  by  the  primary  goal;  and  a 
whole  series  of  secondary  factors  which  contribute  to  the 
information  required  to  satisfy  the  secondary  goals. 

DETERMINE 

primary  goal  PLANIFORM  STABILITY 

y  t  X 

DETERMINE  DETERMINE  DETERMINE 

secondary  goals  MAIN  CHANNEL  CHANNEL  WIDTH  SLOPE 

DEPTH  STABILITY  STABILITY 

STABILITY 


secondary  factors:  channel  depth,  channel  slopes,  channel  width, 
fines  present  on  the  bar  surface,  type  of  bar,  etc.^° 


In  this  simplification  of  our  river  channel  flow  system  the 
flow  chart  for  the  problem  solving  strategy  would  be: 


A'O 


examine  main  channel  depth 
stability 

4 

width  factors  involved 
examine  width  stability 
slope  factors  involved  vo 
examine  channel  slope  stability 


The  secondary  factors  are  connected  to  the  secondary 
goals  by  sets  of  control  rules  which  are  explained  below. 
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In  order  to  implement  this  problem  solving  strategy  in  backward 
chaining  we  would  need  what  is  called  control  rules.  For  example, 

we  would  write 

RULE  1 

IF  Channel  Depth  and  Slope  Stability  known 

AND  Channel  Width  Stability  known 

THEN  Planiform  Stability  known  {conclusion  1} 

RULE  2 

IF  Channel  Depth  Stability  known 
AND  Channel  Slope  Stability  known 

THEN  Channel  Depth  and  Slope  Stability  known  {conclusion  2} 
RULE  3 

IF  Channel  Width  IS  increasing, decreasing, stable 
THEN  Channel  Width  Stability  known  {conclusion  3} 

RULE  4 

IF  Channel  Depth  IS  increasing, decreasing, stable 
THEN  Channel  Depth  Stability  known  {conclusion  4} 

RULE  5 

IF  Channel  Slope  IS  increasing, decreasing, stable 
THEN  Channel  Slope  Stability  known  {conclusion  5} 


The  goal  of  the  above  abbreviated  backward  chaining  system 
is  to  arrive  at  the  conclusion  "Planiform  Stability  is  known". 
Since  Rule  1  has  that  conclusion  as  its  consequent,  the 
inferencing  mechanism  tries  to  satisfy  the  two  antecedents  of 
Rule  1:  "Channel  Depth  and  blope  Stability  known”  and  "Channel 
Width  Stability  known".  The  consequent  of  Rule  3  matches  the 
latter  so  the  inferencing  mechanism  tries  to  satisfy  rhe 
antecedent  of  Rule  3:  "Width  is  increasing,  decreasing, 
stable". We  would  then  have  a  rule  dependency  map  for  the  full 
rule  set  as  below: 


RULE 

RULE 


RULE  2 

RULE  3 


1 


or  a  similar  graph  if  we  plotted  the  relationships  by 
rule  conclusions  instead  of  rule  numbers. 

CONC  4  ^ 

;;;>  CONC  2 

CONC  5  ^CONC  1 

CONC  3 
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The  difficulty  with  this  problem  solving  strategy  is  that  when  we 
add  rules  to  get  the  information  that  is  needed  to  form  the 
conclusions  in  the  higher  level  rules, the  hypotheses  for  those 
rules  may  contain  variables  which  fire  other  rules  that  alter 
previous  conclusions.  For  example,  suppose  we  have  a  rule  that 
says  : 

RULE  6 

If  A  diagonal  bar  is  present 

THEN  Channel  Width  IS  increasing 

then  we  cannot  determine  what  RULE  3  (which  involves  the  Channel 
Slope  in  its  conclusion)  will  say  until  RULE  6  (which  involves 
Channel  Width)  is  evaluated.  If  the  program  happens  not  to 
consider  the  rules  in  the  correct  order  (as  it  doesn't  in  this 
example)  then  the  conclusion  reached  will  not  be  valid.  Thus  we 
see  that  in  the  most  general  situation  conclusions  reached  upon 
considering  beginning  rules  may  have  to  be  reevaluated  in  light 
of  later  conclusions  that  the  rule^based  system  reaches. 


'v, 
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How  to  Croat*  Forward  Chaining  in  a  Backward  Chaining  Syston 

Although  PROLOG  goal  driven  systems  are  not  designed 
primarily  for  data  driven  problems  (see  Brakto  (1986)  and  Amble 
(1987)  for  an  explanation  of  how  these  systems  work)  ,  it  is 
possible  to  simulate  forward  chaining  in  a  backward  chaining 
search  using  global  variables^^  and  recalling  the  goal  (cycling) 
after  each  success.  Newer  versions  (2.5  and  later)  of  the  LEVELS 
software,  written  for  IBM  compatible  personal  computers,  have 
significant  object  oriented  features  which  make  forward  chaining 
in  this  type  of  system  easier.  However,  LEVELS  version  1.1  which 
is  available  for  the  Macintosh  and  used  for  this  report  can  also 
be  used  for  these  type  of  problems. 

In  cycling  the  program  there  must  be  only  one  top-level 
goal  (which  is  called  "for  chain"  in  the  manual^^)  .  A  global 
variable  (called  "step")  then  allows  you  to  consider  different 
groups  of  rules  on  different  passes  through  the  rule  base.  The 
programmer  must  then  organize  the  groups  of  rules  so  that 
every  group  is  consider  in  a  fixed  sequence  of  different  "steps". 

The  hierarchy  of  goal  levels  shown  in  the  manual  where  they 
are  listed  in  outline  form  such  as  1.,  1.1,  2.,  2.1,  only  work 
when  you  need  to  make  a  single  pass  through  the  set  of  rules. 
First  goal  selection  determines  which  upper  level  rule  you  want 
the  compiler  to  unify  variables^^  on.  This  works  just  like  when  a 
PROLOG  compiler  asks  you  which  goal  to  solve  for  in  its  predicate 
rule  base.  It  does  not  prioritize  the  goals  and  search  for  all 
possible  solutions.  But,  the  search  levels  are  created  by  the 
developer  placing  what  is  sometimes  called  a  "salience  factor"^^ 
(or  operator  precedence  factor)  on  the  rule  when  it  is  placed  on 
the  goal  stack^^.  When  a  new  hypothesis  is  placed  on  the  search 


"A  value  established  for  use  when  no  procedure  or  binding 
(a  place  in  memory  reserved  for  a  value  associated  with  a  symbol) 
primitive  supplies  a  value."  Winston  and  Horn,  op.cit. 

12  levels  for  the  Apple  Macintosh,  User's  Guide,  by 
Information  Builder's,  Inc.  1250  Broadway,  New  York,N.Y.  10001. 

12  ''The  process  of  comparing  two  pattern  expressions  to  see 
if  they  can  be  made  identical  by  a  consistent  set  of 
substitutions."  Winston  and  Horn,  op.cit. 

"  A  priority  number  given  to  a  rule.  When  multiple  rules 
are  ready  to  be  satisfied  or  as  is  sometimes  said,  for  "firing", 
they  are  fired  in  order  of  priority.  The  default  salience  is 
zero.  Rules  with  the  same  salience  are  fired  according  to  the 
current  conflict  resolution  strategy."  NASA  op.cit. 

A  list  of  all  the  goals  that  the  inference  mechanism  is 
backward  chaining  in  order  to  satisfy.  The  goal  at  the  top  of  the 
list  is  the  goal  which  the  compiler  is  currently  searching  the 
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stack^^  (also  called  an  agenda)  salience  numbers  are  checked  to 
make  the  insertion.  The  search  then  keeps  going  until  it  bottoms 
out  on  a  lower  level  goal.  Problems  suited  for  this  type  of 
program  organization  are  for  instance,  diagnostic  rule  bases  or 
classification  problems  with  only  one  end  conclusion.  Groups  of 
definitions  of  objects  fall  into  this  category. 

One  way  to  order  the  rule  base  is  to  list  which  rules  have 
variables  in  their  hypotheses,  define  a  partial  order  on  the 
rules  by  letting  the  rules  be  nodes  in  a  graph  and  connect  two 
nodes  (rules)  with  an  edge  if  the  conclusion  of  one  is  used  in 
the  hypothesis  of  another  and  then  topologically  sort  the 
rule  base  according  to  this  partial  order: 

ALGORITHM  TO  TOPOLOGICALLY  SORT  RULES  IN  AN  EXPERT  SYSTEM 


0)  For  the  whole  set  of  nodes  of  conclusions  in  the  rules: 

1)  If  every  conclusion  node  has  a  predecessor,  then  stop.  The 
rule  based  system  has  a  cycle  and  is  infeasible  (that  is,  a 
partial  order  cannot  be  defined  on  it) . 

2)  pick  a  node  V  which  has  no  predecessor 

3)  place  V  on  a  list  of  ordered  nodes 

a)  if  a  terminal  conclusion  node  is  reached,  print  out  the  list 
of  rules  used  on  the  way  to  reach  that  conclusion. 

4)  delete  all  edges  leading  out  from  V  to  other  nodes  in  the 
network 

5)  Go  to  step  0).^® 


knowledge  base  of  consequents  in  order  to  unify  variables  on. 

"A  list  of  all  rules  that  are  presently  ready  to  be 
satisfied.  It  is  sorted  by  salience  values  and  the  conflict 
resolution  strategy.  The  rule  at  the  top  of  the  search  stack  or 
agenda  is  the  next  rule  that  will  fire.”,  NASA,  op.  cit. 

Recall  the  definition  from  mathematics  that  a  partial 
order  is  a  relation  rel(x,y)  between  objects  in  a  set  that 
satisfies  the  reflexive  (  rel(x,x)  is  always  true),  and 
transitive  conditions  (rel(x,y)  and  rel(y,z)  true  implies  that 
rel(x,z)  true).  If  the  relation  also  satisfies  the  symmetric 
condition  (  rel(x,y)  true  implies  rel(y,x)  true)  then  it  is 
called  an  equivalence  relation. 

A  further  discussion  of  the  way  this  algorithm  works  in 
the  case  of  any  partial  order  and  how  to  write  the  pseudo  code 
for  a  simple  version  of  it  is  given  in  the  books:  Fundamentals  of 
Data  Structures  by  E.  Horowitz  and  S.  Sahni,  Computer  Science 
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Example: 


a)  initial  network: 


b)  node  visited  - 1 


remaining  network: 


5 

6 


c)  node  visited  -  4 

2 

remaining  network:  \ 

3 - ►  5 

\ 

6 


Press,  Rockville,  MD,  1982., and  Algorithms  +  Data  Structures  = 
Programs  by  Niklaus  Wirth,  Prentice  Hall,  1976.  AC  source  c^e 
implementation  of  it  along  with  a  further  discussion  is  included 

in  Appendix  II. 
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d|  node  visited  3 
remaining  netwraric 


e|  node  visited  6 
remaining  network:  ^ 

\ 

5 

■t  this  point  a  terminal  conclusion  la  reached  and  the  number  of  levels 
of  rules  needed  (3|  is  printed  out 

f)  node  visited  2 
remaining  network: 

5 

g)  node  visited  5 


An  example  of  how  the  above  algorithm  works  is  illustrated 
in  figures  1  and  2  above. 

First  a  successor  list  for  each  rule  conclusion  node  is  created: 


successor  list  for 
successor  list  for 
successor  list  for 
successor  list  for 
successor  list  for 
successor  list  for 


vertex 

1 

[  2  3  4] 

vertex 

2 

[  5] 

vertex 

3 

C  5  6] 

vertex 

4 

[  6  5] 

vertex 

5 

[] 

vertex 

6 

[] 

Then  the  algorithm  produces  a  topological  ordering  of  vertices 
as  shown  in  the  figures  and  as  listed  below: 


14  3  6 

terminal  conclusion  reached 
3  levels  of  rules  recpiired 


2  5 

terminal  conclusion  reached 
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3  levels  of  rules  required 


with  the  above  topological  ordering  when  rule  6  is 
considered,  the  information  for  either  rule  3  and  4  is  then 
required.  And  each  of  these  rules  will  require  the  information 
from  rule  1.  This  path  of  rules  then  forms  at  least  two  forward 
chaining  "cycles"  or  "steps"  .  We  do  not  know  beforehand  whether 
rule  3  or  rule  4  will  provide  the  way  to  satisfy  rule  6,  hence  a 
separate  step  is  required  to  recycle  through  the  rules  to  cover 
all  possible  cases  (see  the  next  section  for  the  details  of  how 
this  is  implemented  in  the  code) .  To  better  organize  things  for 
future  additions  to  the  rule  base,  it  is  prudent  to  add  another 
step  for  rulel  and  thus  use  three  steps  for  this  path  of  rules. 
For  step  4  in  this  example  we  consider  rule  5.  This  rule  then 
requires  the  information  f'.  .  lule  2  and  rule  1  in  that  order. 
Rule  1  has  already  been  considered  in  step  1.  With  this  path  all 
rules  have  been  considered.  Therefore  two  more  steps  of  recycling 
through  the  rules  are  required  to  consider  the  whole  rule  base. 
These  two  steps  will  then  insure  that  all  the  information 
necessary  to  reach  any  possible  conclusion  has  been  entered. 

If,  in  entering  the  input  information,  we  change  the  order 
in  which  the  nodes  coming  out  of  a  given  vertex  are  ordered  the 
program  gives  a  different  output.  This  is  a  result  of  the  fact 
that  for  a  given  set  of  order  relations  there  may  be  many 
different  ways  of  defining  a  partial  order  on  them.  Consider  what 
happens  in  the  above  algorithm  if  we  change  the  input  order: 


successor  list  for 
successor  list  for 
successor  list  for 
successor  list  for 
successor  list  for 
successor  list  for 


vertex 

1 

[  4 

3  2] 

vertex 

2 

[  5] 

vertex 

3 

[  6 

5] 

vertex 

4 

C  6 

5] 

vertex 

5 

[] 

vertex 

6 

[] 

the  algorithm  will  then  produce  the  following  topological 
ordering  of  vertices: 


1  2  3  4  5 

terminal  conclusion  reached 
3  levels  of  rules  required 


6 

terminal  conclusion  reached 
3  levels  of  rules  required 


However,  upon  examining  this  output,  it  can  be  seen  that  the 
number  of  pa^s,  "cycles",  or  "steps",  required  to  enter  all  the 
information  into  the  classification  system  is  the  same  in  the  two 
cases.  Also,  the  maximum  depth  or  ntimber  of  levels  of  backward 
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reasoning  for  all  cycles  will  be  the  same  for  both  cases.  The 
next  section  will  explain  the  coding  procedures  for  implementing 
these  forward  chaining  cycles  or  steps  which  are  determined  by 
the  topological  order. 
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Proo«4ur«s  to  Uso  in  Cronting  tha  Knovledga  Basa  in  a 
Classification  Typa  Expart  Systaa 

In  order  to  create  a  knowledge  base  there  is  an  organized 
procedure  that  one  can  follow: 

a. Establish  the  facts  by: 

(1)  collecting  all  the  relevant  facts  and 

information 


(2)  divide  the  objects  in  the  facts  into  different 

categories 

(3)  outline  or  catalog  the  complete  set  of  facts 
according  to  these  categories. 

(4)  write  down  all  the  rules  (involving  forward 
chaining)  relating  these  categories. 

(5)  write  down  a  decision  tree  for  what  the  expert 
system  is  trying  to  analyze  such  as  that  shown  in  the  previous 
paragraph . 


(6)  determine  the  one  goal  which  the  expert  system  is 
trying  to  satisfy. 

(7)  write  down  all  the  backward  chaining  rules  which 
help  to  satisfy  that  one  goal. 

(8)  relate  the  forward  chaining  and  backward  chaining 
rules  in  order  to  have  the  expert  system  perform  its  task  in  one 
program  run. 

i)  write  down  a  complete  table  of  all  the  variables 
and  conclusions  involved  in  the  knowledge  base. 

ii)  topologically  sort  the  rules  based  on  the 
order  the  conclusion  occur  in. 

iii)  plot  all  the  paths  in  the  knowledge  base  in 
which  variables  can  be  instantiated  and  conclusions  reached  (this 
is  done  in  paragraph  20  in  this  report) . 

iv)  use  global  variables  to  group  the 
instantiation  of  the  variables  and  predecessor  conclusions 
involved  in  the  hypotheses. 


For  an  example  that  explains  the  forward  chaining  procedure 
consider  the  following  abbreviated  exeunple  using  the  river 
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(5) 

If  we  add  a  rule  or  topic  for  obtaining  the  initial 
information  we  need  to  start  the  forward  chaining,  and  a  final 
rule  or  topic  to  generate  the  report  and  write  it  to  a  file  we 
can  now  write  down  an  outline  or  decision  tree  for  what  the 
expert  system  is  trying  to  analyze. 


1.  Introduction 

1.1  River's  name  entered 

2 .  Draw  conclusions 

2.1  Bar  composition  determined 

2.2  Channel  depth  determined 

3. 

3.1  Report  filed 


RULE  1 

RULE  2  RULE  3 
RULE  4 

RULE  5 


(6)  In  this  case  the  one  goal  that  the  expert  system  is 
trying  to  satisfy  is  to  generate  the  final  report  (which  by  the 
way  should  contain  all  the  conclusions  the  forward  chaining  has 
generated) 

If  we  call  this  one  goal  "forward  chain" 

We  can  now  write  down  a  flow  diagram  for  the  program 
to  reach  all  the  conclusions  we  want: 

step  1 

river's  name 
step  2 

bar  composition 
step  3 

channel  depth 
step  4 

file  a  report  containing  all  the  conclusions 
reached 

(7)  We  can  write  this  rule  to  finish  the  forward  chaining 

as: 


RULE  5 

IF  previous  steps  complete 
AND  FILE  results  file  footer 
THEN  stop 

The  problem  at  this  point  is  that  there  hasn't  been  a  condition 
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channel  flow  information^®: 

(1) .  We  have  various  attributes  that  describe  the  river 
channel  flow  geometry:  among  these  are  bar  composition 
framework  gravel,  censored  gravel,  filled  gravel,  or  matrix 
gravel,  channel  depth (a  numerical  value).  Once  we  know  these 
attributes  we  have  a  list  of  rules  relating  them  from  which  we 
can  draw  inferences. 

(2)  We  define  a  data  structure: 

ATTRIBUTE  The  bar  composition 

AND  channel  depth 

We  will  also  need  a  string  variable  which  hold^s  the 
river's  name  to  write  on  the  final  report: 

STRING  The  river's  name 


(3)  We  can  now  organize  this  information  in  the  following 

outline 


1.  River's  name  entered 

2 .  Bar  composition  determined 

3 .  Channel  depth  determined 

(4)  we  can  now  write  down  the  forward  chaining  rules 
involving  these  attributes  and  variables: 

RULE  1 

IF  The  river's  name  <>"" 

AND  FILE  the  results 
THEN  River's  name  entered 

RULE  2 

IF  Fines  are  present  on  the  bar  surface 
THEN  The  bar  composition  IS  matrix  gravel 
AND  Bar  composition  determine^ 

RULE  3 

IF  NOT  Fines  are  present  on  the  bar  surface 
AND  Grains  on  the  bar  surface  are  interlocked  with 
voids 

THEN  The  bar  composition  IS  Framework  Gravel 
AND  Bar  composition  determined 

RULE  4 

IF  Channel  Depth  >  0 

THEN  Channel  Depth  determined 


The  steps  in  the  example  are  ordered  according  to  the 
letters  in  the  above  procedures. 
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determined  which  in  all  cases  signals  that  the  forward  chaining 
has  ended  and  tells  RULE  5  when  to  fire. 


(8)  In  a  LEVELS  program  it  is  possible  to  have  series  of 
nested  goals  for  which  we  use  the  goal  select  feature  to  choose 
which  ones  the  system  backward  chains  to  satisfy.  But,  in  the 
above  example  the  LEVELS  compiler  will  choose  either 

1 .  or  2 .  or  3 . 

1 . 1  then  3.1. 

2.1  or  2.2. 

But,  it  will  not  be  able  to  search  for  all  combinations  of 
solutions  to  the  whole  list  on  the  same  program  run.  For  once  a 
bottom  level  goal  is  reached  and  there  are  no  more  rules  to  check 
on  the  rule  search  stack  (agenda)  for  the  satisfaction  of  this 
goal  the  program  stops.  That  is,  suppose  the  user  chooses  1.  and 

1.1  using  goal  select  and  suppose  RULE  1  is  of  the  form: 

IF . 

THEN  Rivers  name  entered.  We  have: 

goal  stack  rule  search  stack 

1 . 1  River '  s  name  entered  RULE  2 

(satisfied)  RULE  3 

RULE  4 

(no  further  matches 
possible) 

Then,  none  of  the  remaining  rules  satisfies  the  goal  and  the 
program  stops. 

The  way  'cycling'  works  is  through  the  use  of  a  single  goal 
along  with  global  variable  values.  In  the  above  example  we  would 
have  only  one  goal: 

goals 

1.  forward  chain 

In  rewriting  RULE  1  we  substitute  'forward  chain'  for  the 
previous  goal  and  add  a  global  variable  'step'. 

RULE  1 

IF  The  river's  name  <>"” 
and  FILE  results  file  header 
and  step:=l 

THEN  River's  name  entered  global  variable 

and  step : =2  step 

and  CYCLE 

RULE  2 

IF  Fines  are  present  on  the  bar  surface 
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and  step:»2 

THEN  The  bar  composition  IS  matrix  gravel 
and  Bar  composition  determined 
and  step:»3 
and  CYCLE 

RULE  3 

IF  NOT  Fines  are  present  on  the  bar  surface 

and  Grains  on  the  bar  surface  are  interlocked  and  no  voids 

and  step; =2 

THEN  The  bar  composition  IS  Fraunework  Gravel 
and  step: =3 

anc  Bar  composition  determined 
and  CYCLE 

RULE  4 

IF  Channel  Depth  >  0 
and  step: *3 

THEN  Channel  depth  determined 
and  forward  chain 
and  step: =4 
and  CYCLE 


RULE  5 

if  step; =4  built-in  function  stop 

and  FILE  the  results 
then  stop 

In  this  situation  after  the  goal  forward  chain  has  been 
reached  for  the  first  time  the  global  variable  step  is  changed 
from  1  to  2.  Then,  there  are  more  rules  left  in  the  search  stack 
(agenda)  which  can  satisfy  the  same  goal,  so  the  program 
continues  to  search  for  solutions: 

goal  stack 

1.  forward  chain 
(satisfied 

global  variable 
step 

built-in  function 
stop 

The  way  the  program  finally  stops  is  by  means  of  using  a  built-in 
function.  The  phrase  'built-in'  means  that  the  program  will 
execute 


search  stack 

RULE  2 
RULE  3 
RULE  4 

(further  matches  possible) 
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its  meaning  as  soon  as  it  is  scanned^®  and  before  the  whole 
sentence  is  parsed^ 

note:  In  this  example  because  of  the  small  number  of  rules  it  is 
not  necessary  to  perform  substeps  i},ii),and  iii  in  part  (8)  of 
the  procedure.  The  next  example  shows  how  using  the  basic 
techniques  in  parts  (1)  through  (7)  along  with  the  substeps  in 
part  (8)  it  is  possible  to  organize  a  large  scale  system. 


We  now  consider  the  group  of  all  the  rules  in  our  river 
channel  flow  expert  system  in  which  the  variables  and  conclusions 
are  to  be  chained  together: 

For  these  purposes  a  variable  is  defined  to  be  a  statement 
which  appears  in  the  hypothesis  section  of  a  rule  and  about  which 
it  is  to  be  determined  whether  it  is  true  or  false  or 
instantiated  to  some  object  or  attribute^^. 

We  now  consider  the  group  of  all  the  rules  in  our  river 
channel  flow  expert  system  in  which  the  variables  and  conclusions 
are  to  be  chained  together: 


The  scanner  is  the  part  of  the  compiler  that  analyses 
characters  of  the  program's  text  for  identification  of  known 
words,  variables,  functions,  and  procedures. 

The  parser  is  that  part  of  a. compiler  that  analyses 
complete  sentences  of  the  knowledge  base  (rules,  facts,  etc.)  and 
determines  their  total  meaning. 

A  variable  is  instantiated  when  there  is  some  object  or 
attribute  which  it  is  equal  to. 
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Table  2 


Variable  List 


Variable 


Fines  are  present  in  the  reach _ 


Fines  are  present  on  the  bar  surface _ 


Large  clasts  are  in  direct  contact  _ 


Vegetation  is  present  on  the  bar  surface 


Bar  stability  is  known  _ 


The  bank  is  eroding  _ 


The  bar  is  growing  _ 


Channel  Depth  is  known 


Slopes  are  known  _ 


The  type  of  bar  has  been  identified 


Active  channel  width  is  known 


Fresh  scour  can  be  seen  on  the  outside  bank 


Fresh  scour  can  be  seen  on  the  bar 


The  reach  is  a  river  bend 


Grains  on  the  bar  surface  are  interlocked 


Fresh  deposits  can  be  seen  on  the  bar 


Fines  are  present  in  the  reach 


The  type  of  bar  present 


The  bar  composition 


The  active  channel  width 


The  bar  is  migrating 


Diffuse  gravel  sheets  exist 


Imbrication  is  present 


bar  deposition  will  occur 
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Table  3 


Conclusion  List 

Number 

Conclusion 

Hypotheses^  ^ 

Cl 

The  bar  composition  IS - 

V2,V3,V15 

C2 

The  type  of  bar  present  IS  - 

C3 

The  bar  is  providing  transported 

CIS 

sediment. 

C4 

The  bar  is  eroding  faster  than 

V13,C15 

the  bank. 

C5 

The  active  channel  width  IS  - 

Cll, CIS, C7 

C6 

The  main  channel  depth  IS  - 

C11,C1S,C7,V23,C1 

,C17,V24 

C7 

The  bar  is  growing 

C14,C2,V16 

C8 

Width  conclusion  IS  - 

V12,C2,V14,C4,C17 

,C1S 

C9 

The  active  channel  width  will  be 

C2,V12 

CIO 

The  bar  may  be  migrating 

V2,C2,V17,C1S,V4 

Cll 

The  bank  is  eroding 

V12 

C12 

The  bank  is  providing 
transported  sediment 

Cll 

C14 

The  bar  is  migrating 

C2,V2,V17,V4 

C15 

The  bar  is  eroding 

C2 

C16 

Deposition  is  occurring  on  the 
inside  of  the  bend 

C2,C11,V14 

C17 

Sediment  transport  is  - 

C4,C12 

C20 

Bar  deposition  will  occur 

V22 

C23 

The  slope  IS  - 

CS 

Variables  which  appear  in  a  rules  hypothesis  are  numbered 
VI, V2,  etc.  according  to  the  number  appearing  in  the  variable 
table,  conclusion  are  numbered  Cl,C2,etc  analogously.  The 
information  for  conclusions  which  do  not  have  variables  or  other 
conclusions  listed  in  this  column  is  entered  interactively. 
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C24 


The  channel  stability  is 


C23 


H 

C26 

Meander  pattern  IS  - 

C23  1 
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Table  4 


1  Rule  List^* 

Number 

Rule  Name 

Conclusions 

R1 

Identify  bar 

C2 

R2 

Forcing  a  few  more  conclusions 

C7,C11 

R3,R4,R5 

,R6 

Composition 

C2 

R7 

Bank  Scour  indicates  erosion 

Cll 

R8 

Scour  indicates  erosion 

CIS 

R9 

Bar  erosion  provides  sediment 

C12 

RIO 

Bank  erosion  provides  sediment 

C12 

Rll 

Bar  eroding  before  bank 

C14 

R12 

Stable  channel  with  respect  to 
width 

C5 

R13 

Bar  erosion  implies  increasing 
width 

C5 

R14 

Wider  channel  implies  less  depth 
and  energy 

C6 

R15 

Bank  erosion  implies  increasing 
width 

C5 

R16 

Growing  bar 

C7 

R17 

Narrowing  channel 

C5,C6 

R18,R19, 

R20,R21 

Eroding  bar  indicates  not 
interlocked 

Relative  shear  force 

Bank  scour  Indicates  migration  to 
outside 

No  width  conclusions  can  be 
reached 

C8 

■ 

This  is  an  abbreviated  list  of  the  total  number  of  rules 
in  the  knowledge  base.  In  order  to  simplify  the  table,  some 
groups  of  rules  which  have  no  other  rule  predecessors  have  been 
grouped  together  as  single  rules  and  rules  which  only  perform 
functions  such  as  displaying  text  and  pictures  or  writing  the 
report  have  been  omitted. 
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R22 

Deposition  forming  classical  bar 

C8,C16 

R23 

Bar  scour  indicates  greater 
channel  width 

C9 

R24 

Eroding  bar  may  be  migrating 

CIO 

R25 

Fines  indicate  that  bar  is  not 
migrating 

C10,C14 

R26 

Absence  of  fines  indicates 
migrating  bar 

CIO 

R27 

Vegetation  indicates  stable  bar 

CIO 

R28 

Sediment  transport  increasing 

C17 

R29 

Diffuse  gravel  sheets  exist 

C20,C23 

R30 

Decreasing  channel  width 
increases  slope 

C23,C17 

R31 

Increasing  channel  width 
decreases  slope 

C23,C17 

R32 

Increasing  slope  decreases 
meander  pattern 

C26,C21,C22 

R33 

Decreasing  slope  increases 
meander  pattern 

C21,C22,C26,C17 

R34 

Stable  channel  with  respect  to 
depth 

C6 

R35 

Imbrication  indicates  bed 
stability 

C6 

R36,R37 

Framework  or  matrix  gravel 
affects  depth  of  main  channel 

C6 

R38,R39 

Affect  of  sediment  transport  on 
depth 

C6  1 

R40 

Bar  erosion  implies  less  depth 

C6  1 

R41 

Narrowing  channel  increases  depth 

C6  1 
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A  table  (or  edge  adjacency  can  now  be  made  with  the 

hypothesis  conclusion  vertex  niimbers  from  the  right  colvimn  of 
the  conclusion  list  table  above  on  the  left.  On  the  right  is  put 
the  conclusion  list  vertex  ntmbers  from  the  left  hand  coliuon  of 
the  conclusion  list  table  above.  A  summary  of  the  information 
from  the  core  rules  in  the  expert  system  that  was  entered  into 
the  the  topological  sort  program  in  order  to  organize  its 
knowledge  base  is  given  below: 


C15  C3 

C15  ir>  C4 
Cll  =>  C5 
CIS  r:>  C5 
C7  C5 
Cll  C6 

CIS  C6 

C7  .^>  C6 
Cl  =>  C6 
C17  jr>  C6 
C14  C7 

C2  ->  C7 
C2  ->  C8 
C4  5r>  C8 
C17  =>  C8 
CIS  =>  C8 


C2  =r>  C9 
C2  ;r>  CIO 
CIS  ->  CIO 
Cll  r>  C12 
C2  =>  C14 
C2  r>  CIS 
C2  ->  C16 
C2  r>  Cll 
C4  C17 
C12  =>C17 
CS  ir>  C23 
C23  =>  C24 
C23  jr>  C26 


An  edge  adjacency  list  is  a  list  containing  all  the  end 
nodes  from  a  given  start  node  (or  vertex)  in  the  rule  bases' 
knowledge  tree.  This  is  a  general  term  to  designate  a  data 
structure  used  in  network  search  algorithms.  For  the  exeunple 
given  the  context  being  discussed  the  first  letters  (C)  from  each 
node  have  been  removed  and  a  list  made  of  all  connections, 
indexed  by  the  starting  node. 
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The  topological  sort  program  then  orders  the  rule  niunbers 
according  to  the  priorities  in  their  data  dependencies. 

After  the  information  from  the  conclusion  list  table  is  entered 
the  program  outputs  this  edge  adjacency  list: 


successor  list  for  rule  hypotheses 


vertex 

1 

[ 

6 

1 

vertex 

2 

[ 

7 

8 

vertex 

3 

[] 

vertex 

4 

[ 

8 

17 

] 

vertex 

5 

[ 

23 

1 

vertex 

6 

[] 

vertex 

7 

c 

5 

6 

1 

vertex 

8 

[] 

vertex 

9 

C] 

vertex 

10 

[] 

vertex 

11 

[ 

5 

6 

vertex 

12 

c 

17 

] 

vertex 

13 

[] 

vertex 

14 

[ 

7 

1 

vertex 

15 

c 

3 

4 

vertex 

16 

[] 

vertex 

17 

[ 

6 

8 

] 

vertex 

18 

Cl 

vertex 

19 

[] 

vertex 

20 

Cl 

vertex 

21 

[] 

vertex 

22 

Cl 

vertex 

23 

[ 

24 

26 

1 

vertex 

24 

[] 

vertex 

25 

[] 

vertex 

26 

[] 

9  10  14  15  6  11  ] 

12  ] 

5  6  8  10  ] 


Finally,  after  the  program  has  established  the  partial  order 
of  the  conclusions  by  their  data  dependencies,  the  rest  of  the 
rule  edges  in  the  edge  adjacency  list  at  the  beginning  of  the 
section  can  be  added  to  complete  the  knowledge  tree: 
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^-7  C3 


4- 

Cl  =>  C6 
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THEORETICAL  ALGORITHMS  FOR  SOLVING 
THE  ARMY  STATIONING  PROBLEM 

Janet  Hurst  Spoonamore,  Ph.D. 

U.S.  Army  Construction  Engineering  Research  Laboratory 
P.O.  Box  9005  Champaign,  U  61820-9005 
217-373-7267  email:  spoonam@C8rd.uiuc.edu 

Abstract.  This  research  addresses  algorithmic  approaches  for  solving  the  Army  stationing  problem.  The 
problem  is  formulated  as  an  assignment  problem  with  the  objective  function  being  a  minimization  problem. 
The  specific  assignment  problem  has  piece-wise  linear  additive  separable  server  cost  functions,  which  are 
continuous  everywhere  except  at  zero,  the  point  of  discontinuity  for  the  {0,1}  assignment  condition.  Con¬ 
tinuous  relaxation  of  the  {0, 1}  constraints  yields  a  linear  programming  problem.  Solving  the  dual  of  the 
linear  programming  problem  yields  the  complementarity  conditions  for  a  primal  solution,  a  system  of  linear 
inequalities  and  equalities.  Adding  equations  to  this  system  to  enforce  a  (0, 1}  solution  in  the  relaxed  solu¬ 
tion  set  yields  an  augmented  system,  not  necessarily  linear.  Methods  to  solve  this  S}'Stem,  a  system  of  linear 
inequalities  and  non-linear  equations,  in  a  least  square  sense  are  developed,  ext^ding  Han’s  method  for  solv¬ 
ing  linear  systems  of  inequalities.  Generalizations  of  these  methods  to  solve  general  systems  of  inequalities 
in  a  least  square  sense  are  developed.  A  sample  problem  is  shown. 

Key  words:  Assignment  problem,  least  square  problem,  stability,  global  convergence,  local  convergence, 
almost  everywhere  differentiable,  Clarke  subdifferential. 
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1.  FDrmulating  the  Stationing  Problem. 

Presently,  the  military  is  transistioning  from  the  Cold  War  era  to  the  new  “Power  Projection  Platforms” 
to  meet  the  global  challenges  of  the  21st  century.  Major  stationing  changes,  realignments  and  force  structure 
reductions  are  being  planned  and  executed.  At  the  same  time,  increasing  fiscal  constraints  require  attention 
to  cost  efficient  operations.  It  is  important  to  consider  the  least  cost  options  of  basing  units  among  all 
possible  installations.  By  formulating  an  assignment  problem,  which  is  based  on  cost  functions  of  different 
unit  types  at  the  various  installations,  one  could  determine  the  least  cost  stationing  alternative  among  all 
possible  alternatives. 

Let  i6/  =  {l,2,  ...,Tn}  represent  military  units  to  be  assigned  to  installations  j  €  J  =  {1,2,3,. ..,n}. 
Let  1'"  e  represent  the  unit  cvinand  vector  of  the  m  military  units.  For  each  j  €  J,  i  &  I,  let  j/ij  be 
the  relative  amount  of  demand  i,  assig'ie.l  to  installation  j,  i.e.,  y  e  /i"*”.  The  variable  y  is  constrained  so 
that  0  <  for  each  j  €  J,  i  €  I.  Further,  the  variable  y  must  meet  the  demand  for  each  t.  That  is,  for 
all  t  €  /, 

j€J 

Let  positive  linear  cost  coefficients,  dj,  and  positive  minimum  cost  constants,  be  associated  with  each 
installation  j  and  military  unit  i.  That  is,  assume  for  alH  €  /  and  j  6  J,  that  c  j  >  0  and  (l>j  >  0.  To 
simplify  notation,  interpret  the  vector  to  be  the  elements  of  y  €  3?'"”,  associated  with  installation  j. 
The  cost  function  for  each  installation  j  is: 

I  max(</>j ,  (c,,j ,  y,j ) )  if  any  yij  >  0, 

\0  ifyij  =0,V^  e/. 


This  installation  cost  function  represents  a  flat  minimum  charge  with  linear  rates  beyond  the  minimum. 

This  assignment  problem  belongs  to  a  family  of  problems,  called  the  uncapacitated  facility  location  prob¬ 
lem.  It  is  a  generalization  of  the  problem  described  in  C!onn  and  Cornuejols[1990].  See  Spoonamoref  1992] 
where  the  relaxation  of  the  problem  is  expressed  and  the  relaxed  dual  is  formed.  Conn  and  Comuejols[1990] 
show  solving  this  problem  by  solving  the  relaxed  dual  where  they  define  grekdients  for  determining  search 
directions.  The  relaxed  problem,  rather  than  being  solved  using  the  usual  linear  programming  methods,  can 
be  solved  by  expressing  the  problem  as  a  system  of  linear  equalities  emd  inequalities  and  solved  using  the 
method  developed  by  Han[1982]. 

Sample  Problem: 


In  order  to  illustrate  the  primal  and  dual  relationship,  the  following  simple  problem  is  shown.  Let 
m  =  4,  n  =  3;  let  0  =  (26, 30, 30);  let 

/20  25  25\ 

=  ®  ^  ®  1 
40  40  44  I  ■ 

\30  30  22/ 


no 


The  optimal  value  is  87'^,  achieved  at,  for  example,  t  =  (1,0, 1)  and  w  =  (1,0, 1)  and: 


0  0  0 


0  0  2^ 


0  0  0  ’  U  0  ^  • 


1  0  0 
0  0  1 


0  0  1 


5j  =  =  0,  and  S3  =  ^.  Thus,  ri  =  r2  =  1,  and  ra  =  -jy.  We  show  that  this  solution  satisfies  the 

optimality  conditions  (2.3.1). 

First,  consider  j  =  2,  where  yj,2  =  0  for  all  *.  satisfying  /§  =  and  =  0-  Further,  tua  =  0  is 

consistent,  since  J-  =  {1,2}  and 

(c.,2.2/»,2)  =  0  >  W2<l>2  =  0  *  30  =  0. 

Second,  consider  j  =  3.  /“  =  {1}  and  /“  =  {2,4}.  It  must  be  that  €  (0, 1)  and  =  {3}.  It  must 

be  that  ^4,3  =  1,  which  implies  that  W3  =  1  =  y2,3.  To  satisfy. 


{c.,3.y*,3)  =W3<I>3  ^  30, 


then  y3,3  = 


Lastly,  consider  j  =  1,  where  J®  =  {2)4}  and  /}“  =  0.  Since  1  6  J-,  then 


(c.,i,y..i)  =  57—  >  wi<t>i  =  26. 


yi.i  =  1  <  tui  =  1;  yi,3  =  55  <  wi. 


For  i  =  3,  53^  yzj  =  1.  Further,  for  all  i,  y<,j  =  1. 

The  full  set  of  solutions  of  this  problem  include:  yj_j  =  0,  except  as  follows: 


3/1,1  =Wi  =  l,  1/2,3  =  y4,3  =  1^3  =  1)  1/3,3  =  ^) 


and  tU2)  2/3,2)  and  satisfy: 
■W2  <  1) 


1/3,1  > 

41 

^3.2  =  ^  -  1/3.1’ 


0  <  y3,2  <W2<  21/3,2- 
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2.  Least  Square  Formulation  for  Solving  Systems  of  Inequalities. 

By  expressing  the  solution  to  a  linear  programming  problem  as  the  solution  to  a  system  of  inequalities  and 
equalities,  one  can  take  advantage  of  the  work  of  Han[1982],  Mangasari6m[1981}.  For  simplicity,  we  only 
consider  the  system  of  inequalities.  Firstly,  consider  linear  systems. 

2.1  Statement  of  Problem:  Consider  a  linear  transformation, 

:  3?"  »"*,  6  €  X  e  3?”. 


One  wishes  to  solve  for  x  e  3i",  such  that 


Ax  <  b. 


(2.1) 


The  least  square  formulation  of  (2.1)  yields 

mm/(x)  :=  |((Ac  -  6)+,  (At  -  fe)+),  (2.2) 

where  (Ac  —  6)+  denotes  the  projection  of  Ac  —  6  onto  31^. 

2.2  Numerical  Stability:  Robin8on[1975,1976]  shows  the  numerical  stability  properties  of  solution  sets  of 
systans  of  inequalities.  Let  (I  {x|  Ac  <  b}.  Define  (I  as  stable  if  for  all  Xo  €  -  ,  there  exists  0,6  sudi  that 
for  any  i4' :  31”  •->  31”*,  b'  €  Si*"  satisfying 


||A-A||  +  l|6-b'||<b, 


then 

dist  (xo,fl')  <  0p{xo), 

where 

fl'  :=  {x|  A'x  <  b'},  p(x)  :=  in^  ||b'  —  A'x  -  fc||. 

Define  the  system  v4x  <  b  as  regular  if  there  is  an  x  €  31”  such  that  Ax  <b. 

Robinson  shows  that  fl  being  stable  is  equivalent  to  the  system  Ax  <b  being  regular.  The  optimality 
conditions  for  a  solution  x,  z  which  solves 

inf  \{z,z) 

XfZ  h 

subject  to:  Ax  —  b  <  z 


are 


A^z  =  0, 
z  >0, 
Ac  —  b  —  z  <  0, 


(2.3) 


(z.  Ax  —  b  —  z)  =0. 

Thus,  Q  :=  {(x,  z)|  z  >  0,  Ax  —  b  —  z  <  0}  is  stable  since  the  system  z  >  0,  Ax  —  b  —  z  <  0  is  regular. 
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3.  Local  Convergence  Properties  of  Taylor  Series  based  on  Generalised  Differential  Constructs. 

In  this  section,  wc  show  that,  by  dcHninf'  {'encrali'/csl  differential  coastrijets  for  the  least  sf|iiare  objective 
function  and  the  projection,  one  can  take  wlvantage  of  stnuig  approxiinatini'  jirofMirties  in  the  'I'aylor  serkis 
expansion  based  on  these  constructs.  Most  importantly,  w«:  first  show  that  these  constructs  >ire  well  defiiKKl. 
Note  that  {Ax-b)+  juid  V/{x)  are  l>oth  Lipstdiitx  continuous  funcrtions  throughout  all  W".  'i'hus,  the  Clarke 
generalized  .Ituxibian,  i){Ax  —  6)+,  >uk1  generalizwl  Hes.si>ui,  i)Vf(x),  l>oth  exist  and  fue  well-fleHn*!<l.  Ib^:all 
that  J{x)  e  d(Ax  -  b)+  meaas: 

{ai  if  (ai,x}  -  fc,  X), 

Xia{  if  {Oi,x}  -bt=  0,  where  A,  6  [0, 1], 

0  if  (ai,x) -/>,<  0. 

Similarly,  H{x)  e  <>V/(x)  is  determined  by  choice  of  .7(x)  €  (){Ax  -  b)+  where  H{x)  =  .]'  {x).J(x).  S»k: 
Spoonamore[1992]  where  the  following  proposition  is  proven. 

Proposition:  Existence  of  an  Identification  Neighborhood. 

Let  X,  be  given  and  let 

/"(i.)  :=  =  k], 

/+{x,)  ;=  {i|  (a„x.)  >  M, 

/~(i.)  :=  {i|(ai,x,)  <  hi}. 

Then  there  exists  a  neighborhood  iV(i,,r)  of  i,,  such  that  for  x  €  N{x,,r): 

,  /«{x)  C  /“(x.), 

/+(x.)  = /+(x)\/“(x.),  (3.1) 

/-(x.)  =  /-(x)\/^x.). 

This  neighborhood  is  referred  to  as  the  identification  neighborhfxxl  of  the  point  x.  with  respect  to  the 
.system  of  inequalities  Ax  <  b.  The  approximating  properties  of  the  generalizeel  .lacobian  and  the  generalizeel 
Hessian  are  shown  in  the  following  propositions  which  are  developed  in  Sp<x>namore[1992]. 

Proposition.  Perfect  Approximating  Property  of  the  Generalized  .Jacobian. 

Let  X,  be  given  and  let  N{x„r)  be  a  neighborhood  which  satisfies  (3.1)  in  the  proposition,  above;.  Lf:t 
X  e  )V(i,,r)  and  let  p  :-  x,  -  x,  then  for  any  J(x)  €  d{Ax  -  h)+, 

(A(x  +p)  -  h)+  -  (Ax  -  h)+  -  .J{x)p  =  0. 

Proposition.  Perfect  Approximating  Property  of  the  Generalizerl  Hessian. 

Let  X,  be  given  and  let  N{x,,r)  be  an  identification  neighborhood  as  above.  Let  /(x)  :=  -  b)  +, 

(Ax  -  6)+).  Let  X  €  N(x,,r}  and  let  p  ;=  x.  -  x.  Then  for  any  ./(x)  €  fJ(Ax  -  b}  +  , 

fix  +  p)  -  fix)  -  ip,  Vfix))  -  i(p,  ./^'(x)./(x)p)  =  0. 
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4.  Ck>ncliision' 

In  Spoonamore{1992],  these  properties  are  used  to  develop  optimization  algorithms  which  parallel  the  existing 
algorithms  for  the  differentiable  case.  The  Army  stationing  problem  can  be  solved  by  formulating  the  problem 
as  an  assignment  problem  having  a  piece-wise  linear  objective  cost  function.  The  relaxation  of  the  problem 
into  a  linear  programming  problem  allows  solution  using  several  methods  including  the  method  based  on 
solving  systems  of  inequalities. 
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One  of  numerous  Army  applications  for  asset  management  is  to  move  a  network  of  transceivers 
belonging  to  friendly  forces  into  position  to  monitor  the  communications  of  a  similar  networic  (rf 
transceivers  managed  by  an  opponent.  Many  previous  attempts  to  solve  the  bounded  resource  allocation 
problem  have  been  excursions  into  the  realm  of  unconstrain^  optimization,  whereas  other  attempts 
have  framed  the  problem  as  one  of  constraint  satisfaction.  The  two  approaches  are  seemingly  at  odds 
with  each  other,  since  it  has  been  a  n\oot  point  about  whether  constraints  be  utilized  explicitly,  or 
incorporated  into  an  objective  function,  liiis  paper  attempts  to  resolve  the  conflict  by  showing  that  the 
two  approaches  are  complementary,  albeit  at  oppoate  ends  of  the  algorithmic  complexity  spectrum. 
When  no  constraints  are  available  to  control  tte  placement  of  k  transceivers,  the  diallenge  of  the 
problem  is  to  maximize  the  disjunction  of  k  fields  of  view,  and  it  is  shown  that  the  time  complexity  is 
exponential  in  the  number  of  transceivers.  When  the  locations  of  opposing  force  transceivers  are  known, 
together  with  the  radio  frequency  (RF)  propagation  graph,  the  problem  reduces  to  Knuth's  staUe 
nnarriage  theorem,  and  the  resultant  complexity  is  linear  in  the  number  of  transceivers.  In  addition  to  a 
characterization^of  the  computatioiiAl  complexity  of  the  problem,  two  other  results  have  enwrged  from 
this  research:  a  novel  strategy  to  allocate  rriays  and  transceivers  lo  the  hinges  of  invisiUe  areas;  and 
a  massively  parallel  architecture  to  compute  a  comprdwnsive  ensemble  of  field  of  view  bitmaps. 


Statement  of  the  problem. 

In  diverse  terrain,  when  using  a  network  of  friendly  transmitter/receiver  (transceiver)  devices 
to  collect  data  being  communicated  by  a  network  of  transceivers  controlled  by  an  adversary,  there  is  a 
dynamic  requirement  to  spatially  configure  the  network  in  such  a  way  that  friendly  transceivers  collect 
maximal  information  from  the  adversary  without  forsaking  the  ability  to  communicate  among 
themselves.  The  control  knowledge  to  guide  the  optimization  process  is  in  large  part  a  function  of  what 
one  can  cooperatively  see  or  hear  fmm  various  vantage  points  of  the  terrain,  which  fonrailates  a 
specialized  problem  in  resource  allocation.  The  allocation  process  may  or  may  not  be  facilitated  by  a 
variety  of  constraints,  depending  upon  their  availability.  If  few  constraints  present  themselves,  then 
the  problem  is  hard;  on  the  other  hand,  if  constraints  are  readily  availaUe,  then  the  problem  is  made 
correspondingly  simpler.  The  flip  side  of  the  collection  problem,  called  jamming,  is  relevant  when  it  is 
desired  to  deny  adversarial  communication,  rather  than  to  collect  it.  The  overall  problem  of  tasking  a 
bounded  number  of  communications  devices  to  collect  and/or  jam  adversarial  communications  is  called 
the  Intelligence  Electronic  Warfare  (lEW)  asset  management  problem.  In  the  discussion  below,  the 
word  "resource"  or  "asset"  refers  to  either  a  transceiver  or  a  jammer,  whether  it  be  ground-based  or 
airborne. 


The  issues  of  line  of  sight  and  field  of  view. 

Une  of  sight  (LOS)  is  a  concept  which  distinguishes  between  what  parts  of  a  map  are  visible 
and  invisible  along  a  given  line  to  an  observer  located  at  a  specific  vantage  point.  Field  of  zriew  (FOV), 
also  called  area  coverage  (AC),  is  the  union  of  all  lines  of  sight  radiating  outward  from  a  specific 
vantage  point.  For  either  the  constrained  or  unconstrained  transceiver  placement  problem,  LOS  and 
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POV  are  fundamental  operations.  Within  the  De|7artment  of  Defense,  thoe  has  been  a  proliferation  of 
line  of  sight  algorithms,  which  has  recently  prompted  a  standardization  study  [Jl].  There  are  two 
types  of  line  of  sight  algorithms:  optical  and  electronic.  The  optical  version  renders  a  field  of  view 
display  based  on  what  is  visiMe  to  a  rangteless  optical  sensor.  Vegetation,  among  other  factors,  may 
cause  seasonal  variations.  The  electronic  version  addresses  a  harder  problem  in  that  it  takes  into 
account  radio  frequeiKy  (RF)  propagation  and  associated  power  loss.  It  too,  is  affected  by  vegetation 
and  by  other  factors,  such  as  the  attenuating  effects  of  a  rough  vs.  smooth  earth,  troposcatter, 
diffraction,  reflection,  soil  conductivity,  and  solar  flux.  One  of  the  more  sophisticated  electronic  line  of 
sight  algorithms  is  the  Terrain  Integrated  Rough  Earth  Model  (TIREM)  [SlJ. 

In  Figure  1,  a  simulated  terrain  and  corresponding  topographic  map  are  depicted  to  illustrate 
optical  line  of  sight  concepts.  In  the  topographic  map,  light-colored  areas  are  at  Mgher  elevation. 

The  terrain  consists  of  hilltops  A  and  B  connected  by  saddle  E,  which  forms  a  divide  between  valleys  C 
and  D.  Vantage  point  C  offers  vistas  of  hills  A  and  B,  and  saddle  E,  but  does  not  provide  views  beyond. 
The  field  of  view  bitmap  from  point  C,  denoted  Xc,  is  characterized  by  the  diagonal  line  running  from 
the  lower  left  comer  of  the  topographic  map  to  the  upper  right  comer,  portrayed  at  the  left  of  Figure  2. 
Visible  areas  are  colored  white,  whereas  invisible  areas  are  gray.  From  C,  one  cannot  see  over  into 
valley  D,  nor  from  D  can  one  see  into  valley  C.  Note  that  vantage  point  D's  line-of-sight  bitmap,  Xq/  is 
the  comptement  of  Cs.  It  is  important  to  observe  that  taken  together  (the  set  union  Xc  U  Xq), 
oond>iii^  vista  offered  by  C  and  D  is  as  oomprehenstve  as  that  offered  by  points  A,  B,  or  E  separately. 


Figure  1.  A  simulated  tenain,  and  corresponding  topographic  line  map. 


Figure  2.  Optical  field  of  view  bitmaps  from  valleys  C  and  D  respectively  (white  is  visible). 
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All  field  of  view  algorithms  ciurrently  under  develqpinent  by  the  Department  (rf  Defense  utilize 
Digital  Terrain  Elevation  Data  (DTED)  as  input  to  generate  fidd  of  view  bitmaps.  DTED,  produced  by 
the  Defense  Mapping  Agency  (DMA),  is  a  multi-megabyte  gridded  database.  A  gridded  database  is  a 
discrete  representatitm  of  a  continuous  terrain,  where  an  devation  value  is  associated  with  each  grid 
cell  of  the  map  of  intoest.  In  Figure  3,  the  topografMc  map  seen  eariier  has  been  overlaid  with  a  ten 
^  ten  cdl  grid.  Some  <rf  the  grid  odb  have  been  hbded  with  Hieir  respective  devation  values  to 
illustrate  a  gridded  database.  Two  different  grid  lesdutions  are  currently  availaUe  from  the  Defense 
Mapping  Agency:  DTED  Level  1,  representing  one  hundred  meter  horizontal  spacii^  and  DTED  Levd 
2,  with  thirty  meter  spacing.  DTED  Level  2  provides  a  finer  resolution  product,  but  the  computer 
memory  overhead  is  more  prohibitive. 


Figure  3.  A  gridded  elevation  database. 


When  constructing  a  field  of  view  bitmap,  any  algorithm  which  accesses  each  element  of  the 
digital  grid  is  said  to  be  an  exact  algorithm.  With  current  technology,  the  time  complexity  required  to 
produce  an  exact  optical  field  of  view  is  O  ( n^  ],  where  n  is  the  number  of  sampled  elevation  values 
along  one  edge  of  the  DTED  database.  If  some  elements  of  the  digital  grid  are  ignored  or  bypassed 
during  processing,  then  the  algorithm  is  said  to  be  approximate.  There  are  a  variety  of  algorithms  of 
time  complexity  O  ( n^  ]  which  produce  an  approximate  field  of  view;  for  examples,  refer  to  [Bl,  Rl]. 
The  Broome  algorithm,  which  utilizes  a  moving  horizon  technique,  was  the  first  to  achieve  O  [  n^  ]  time 
complexity.  Empirical  data  indicate  that  the  Ray  algorithm  produces  output  matching  the  exact  with 
high  probability  [R2].  On  a  single  processor,  it  is  easy  to  see  that  the  time  complexity  to  produce  a  set 
of  exact  field  of  view  bitmaps  for  an  n  x  n  gridded  database  is  O  (  n®  ],  while  the  time  complexity  to 
produce  a  set  of  approximate  field  of  view  bitmaps  is  O I  n*  ]. 

Electronic  line  of  sight  algorithms  combine  power  loss  with  terrain  to  produce  a  set  of  power 
contours  emanating  outward  from  a  sensor  placement  position.  Generally,  electronic  line  of  sight  is  more 
encompassing  than  optical  line  of  sight,  because  electronic  signals  can  be  heard  in  places  which  cannot 
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be  seen.  For  the  transcdver  allocation  proUem,  it  is  important  that  two  potential  transceiver  sites  be 
within  die  same  power  contour  at  a  specified  signal-to-noise-level.  If  a  transmitter  lies  beyond  the 
contour,  it  cannot  be  heard  by  another  transceiver.  Also,  if  aU  odio*  transceivers  lie  outside  a  power 
contour  corresponding  to  a  spedBc  transceiver  location,  thm  the  transceiver  wiU  not  be  aUe  to 
communicate  St  that  power  levd  with  neigl^ring  transceivers. 


Global  visibility  and  the  unconstrained  problem. 

For  asset  management  applications,  it  is  useful  to  quantize  field  of  view  by  devising  a  metric 
which  indicates  global  visibUity  of  the  bitmap.  If  there  are  w  visible  grid  cells  on  an  n  x  n  bitmap  X, 
then  the  visibility  ratio  of  the  Utmap,  denot^  v,(X),  is  defined  to  be: 

v,(X)  =  w/n^  [1] 


The  visibUity  ratio  is  a  metric  useful  for  transceiver  allocation  during  regimes  of  unconstrained 
optimization,  when  fields  of  view  from  several  vantage  points  must  be  cooperatively  combined  to 
a^ieve  maximal  global  visUiUity.  When  the  aUocation  process  cannot  avail  itself  of  the  search 
reduction  afforded  by  constraint  satisfaction,  then  it  must  resort  to  an  unconstrained  c^timization 
scheme.  In  the  unconstrained  transceiver  placement  problem,  the  following  objective  function  must  be 
maximized,  where  Xi  is  the  field  of  view  Utmapcorre^nding  to  the  i***  grid  cell  (where  i  is 
incremented  from  left  to  r^t,  ttien  donvn),  and  k  is  the  number  of  transceivers  allocated: 

k 

Vj  [  U  A,,  1,  where  A,i  =  X.J,  1  j  S  n*  [2] 

i-l 

Theorem.  For  the  unccmstralned  transceiver  allocation  problem,  the  time  complexity  required  to 
optimally  place  k  transceivers  is  O  [  n^  1,  where  n  is  the  number  of  grid  cells  along  one  edg^of  the 
gridded  elevation  database. 


Proof.  On  a  gridded  database  witti  dimensions  n-x  n,  the  number  of  ways  to  place  k  transceivers  is  the 
combination  ^  n^  ob)^  grouped  k  at  a  time: 

) 


k(k-l)-l 


[3] 


This  result  is  discouraging,  since  it  indicates  Uiat  as  the  number  of  transceivers  to  be  allocated 
increases,  the  time  required  to  maximize  the  objective  function  increases  exponentiaUy  in  the  number  of 
transceivers.  An  exact  solution  to  the  unconstrained  problem  is  therefore  intractable.  To  illustrate, 
consider  the  case  of  placing  three  transceivers  on  a  DTED  level  2  grid.  For  the  Killeen,  Texas  map,  the 
DTED  grid  dimensions  are  901  x  901 .  Appealing  to  the  theorem,  the  time  complexity  required  to  place 
three  transceivers  on  the  grid  is  0 1 901*  ]  ~  0 1 10’*  ].  For  each  of  the  10’*  configurations  of  transceivers, 
the  objective  function  at  [2]  must  be  computed  and  compared. 

To  be  practical,  one  must  resort  to  heuristic  techniques  to  obtain  an  approximate  solution.  One 
such  technique  is  simulated  aimealing  [KIJ.  Simulated  annealing  is  a  computer  optimization  technique 
which  models  the  annealing  process  of  metallurgy,  in  which  a  physically  stressed  metal  is  first 
heated,  and  then  cooled  at  a  certain  rate  to  produce  a  stronger  metal.  The  strongest  such  metal 
obtainable  by  the  annealing  process  corresp>onds  to  the  global  maximum  of  the  objective  function 
formulated  for  a  specific  optimization  problem. 
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The  Sensor  Placement  Analyzer. 


Emulated  annealing  has  recently  been  successfully  adafMed  to  the  bounded  resource  allocation 
problem,  in  a  ^stem  called  the  Sensor  Placennent  Analyzer  (SPA)  [PI].  The  SPA  work  was  sponsored  by 
the  Department  of  the  Army  with  Small  Business  Innovative  Research  (SBIR)  funding.  The  olqective 
of  SPA  is  to  maximize  an  objective  function  corresponding  to  global  visiUlity  (with  respect  to  optical 
field  of  view)  among  cooperative  sensor  resources.  SPA  as  originally  implemented  was  adapted  to  a 
124XXX)  scale  USGS  map,  rqxesenting  a  30km  x  30km  section  of  Madison  County,  Virginia.  Because  of 
the  prohibitive  numbo*  of  sensor  placement  locations  required  to  address  Ae  unconstrained  fmiblem,  the 
SPA  implementors  elected  to  identify  the  highest  elevation  cell  within  each  of  nine  hundred  grid  cells 
(each  one  km  on  a  side),  and  restrict  sensor  placonent  to  these  local  maxima.  The  maxima  were 
extracted  by  hand-selecting  contour  data,  while  utilizing  a  digitizing  tablet  georeferenced  with  a 
scanned  map  image  of  Madison  County.  For  each  of  the  nine  hundred  local  maxima,  a  field  of  view 
bitmap  was  precomputed,  depicting  area  coverage  from  the  site.  To  constrain  the  sensor  placonent 
problem,  a  user  is  able  to  interactively  create  polygonal  regions  corresponding  to  both  red  (adversarial) 
and  blue  (frioidly)  areas  of  interest,  which  govern  respectively  where  red  or  blue  sensors  may  be 
placed.  The  user  is  also  able  to  specify  with  a  pull-down  menu  which  of  two  optimization  regimes  to 
run  -  a  simulated  annealing  version,  or  a  faster  locally  constrained  scheme.  Another  constraint  on  the 
system  allows  the  user  to  specify  whether  radio  netting  constraints  are  to  be  obeyed  when  placing 
sensors.  Although  there  are  no  benchmark  ground  truth  data  available  to  test  the  quality  of  the 
system's  performance,  it  is  readily  apparent  to  an  observer  that  excellent  solutions  are  derived  by  SPA. 

There  are  some  limitations  to  SPA,  as  originally  implenrtented.  One  is  confinement  of  sensor 
placement  to  local  maxinui,  which  implies  that  field  of  view  from  local  minima  (valleys)  is  not 
considered.  A  new  version  of  SPA,  tentatively  called  the  general  placement  analyzer  (GPA),  addresses 
this  limitation  [B2].  Another  limitation  suffered  by  any  algorithm  utilizing  simulated  annealing  is 
the  development  of  an  annealing  schedule  which  lends  itself  to  both  high  performance  and  an 
admissible  solution  (i.e.,  an  optimal  solution).  The  primary  complaint  about  simulated  annealing  is 
usually  directed  at  its  slowness  to  converge.  GPA  plans  to  overcome  this  limitation  by  utilizing  as  many  . 
domain-specific  constraints  as  possible  to  reduce  the  search  space  to  a  minimal  covering  set  of  potential 
sensor  sites.  One  constraint  wluch  offers  good  leverage  specifies  that  a  collection  asset  such  as  a 
transceiver  must  be  located  probabilistically  within  a  relatively  tight  spatial  bound  about  a  roadway. 


Exploiting  the  fringes  of  invisible  areas  to  fadlitate  combined  field  of  view. 

Transceiver  placement  algorithms,  even  when  equipj^  with  field  of  view  bitmaps,  are 
frequently  in  a  quandary  regarding  where  to  place  a  second  transceiver  after  one  transcdver  has  been 
placed  and  its  field  of  view  displayed.  The  author  suggests  the  following  technique  -  place  the  second 
transceiver  upon  the  edge  of  an  invisible  •  rea.  The  edges  of  invisible  areas  are  often  ridges  or  lips  of 
depressions,  offering  excellent  vistas  of  regions  not  visible  from  the  first  transceiver  location.  The  order 
of  placement  is  an  open  problem.  If  one  adopts  a  greedy  technique,  which  is  not  guaranteed  to  produce 
an  optimal  solution,  one  can  select  the  field  of  view  bitmap  with  greatest  visibility  ratio  as  a  good  site 
for  the  first  transceiver.  To  facilitate  cooperative  line  of  sight,  one  then  moves  another  transceiver  to 
the  fringe  of  the  largest  invisible  region.  An  alternative  is  to  move  a  relay  to  the  near  fringe  and  a 
transceiver  to  the  far  fringe.  This  logic  is  iterated  until  the  supply  of  transceivers  is  exhausted,  until 
the  summed  visibility  ratio  approaches  one,  or  until  it  is  observed  that  no  improvement  is  forthcoming. 
In  Figure  4,  if  one  transceiver  is  placed  at  site  T,  then  other  transceivers  and  relays  may  be  placed  as 
shown  to  enhance  combined  field  of  view.  The  fringe  exploitation  technique  does  have  limitations, 
which  include:  an  invisible  area  may  be  within  hostile  territory,  there  may  not  be  easy  transport  to 
the  fringe  area,  or  the  fringe  area  may  reside  beyond  electronic  line  of  sight  of  the  first  location. 
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Figure  4.  Assigning  relays  and  transceivers  to  fringes  of  invisible  areas. 


Constraints  on  lEW  asset  management 

The  constraint  set  for  lEW  asset  maiuigement  consists  of  a  variety  of  databases,  some  static  and 
some  dynamic.  Static  databases  indude:  a  table  of  the  performance  characteristics  of  the  transmitters 
and  the  receivers  under  consideration;  a  set  of  feastUe  collection  sites  baaed  on  Digital  Terrain 
Elevation  Data  (DTED);  and  a  database  comprised  of  die  Defense  Mapping  Agency's  diemadc  vector 
overlay  Tactical  Terrain  Data  (TTD)  [Ml].  The  latter  database  is  projected  to  contain  a  set  of 
topographic  contours,  widi  contour  intervals  ^ledfied  by  the  map  resolution  (e.g.,  a  150XX)0  scale  nu^ 
uses  a  10  meter  contour  interval).  Also  important  are  the  optical  and  electronic  line  of  si^t  databases, 
which  encode  fields  of  view  finm  arbitrary  locations.  These  databases  are  normally  computed  in  a 
preprocessing  step,  which  consumes  quintic  processing  time  and  quaitic  memory  to  p^uce  n^  exact  fields 
of  view.  Dynamic  databases  indude  (but  are  not  limited  to):  a  user-provided  representation  of  the 
forward  edge  of  battlefield  activity  (FEBA)  and  forward  line  of  troops  (FLCXT);  phase  lines 
corresponding  to  the  division  area  of  interest  (DAI)  and  named  areas  of  interest  (NAI);  no-go  or  slow-go 
areas  represented  by  a  modified  combined  obstacles  overlay  (MCOO). 


Tnuuceivcr  placement  confined  to  local  elevation  extiema. 

Because  of  tiie  large  number  of  potential  transceiver  locations  on  a  map  n  grid  cells  on  a  side, 
many  approaches  to  generation  of  Add  of  view,  induding  the  original  verdon  of  tiie  Sensor  Placement 
Analyzer  described  above,  espouse  selecting  potential  sensor  sites  based  on  local  elevation  maxima. 
Therefore,  the  local  maxima  of  the  DTED  database  are  suitable  candidate  locations  for  transceiver 
placement.  Relatively  high  elevations  generally  provide  high  visibility  vistas,  making  this  a 
powerful  technique.  However,  low  spots  of  the  terrain  are  not  accommodated,  so  it  is  not  posdble  to 
develop  a  field  of  view  from  a  river  valley.  It  has  been  observed  that  certain  river  valleys,  such  as  the 
Chosin  reservoir  valley  in  Korea,  offer  relatively  wide  fields  of  view  [Rl].  It  is  also  true  that  in 
mountainous  terrain,  tiiere  are  many  more  roads  in  tiw  valleys  and  passes  than  on  the  peaks 
themselves,  and  transceivers  are  frequently  deployed  within  a  relatively  tight  spatial  neighborhood 
about  a  road.  As  a  final  note,  there  are  many  sites  on  a  map  which  are  not  local  maxima  or  minima,  yet 
still  provide  good  directional  vistas.  Shelves  or  ledges  on  mountain  sides  frequently  offer  excellent  one 
hundred  eighty  degree  vistas  of  a  terrain,  but  are  overlooked  during  the  automated  site  selection 
process.  When  looking  for  adversarial  transmitters,  one  needs  good  field  of  view  in  the  direction  of  the 
FEBA.  One  research  effort  has  devised  a  coIor<oding  scheme  to  signify  the  relative  visibility  of 
terrain  to  a  user,  based  on  the  user's  requirements  [Rl]. 
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FOiering  out  sites  which  are  too  near  map  features. 


To  minimize  interference  and  to  avoid  detection.  Army  doctrine  qxdfies  dwit  communicatkms 
equiiHnent  be  stood  off  specified  distances  from  map  features  sudi  as  road^  streams,  and  power  lines.  If 
a  cai^idate  transceiva*  location  is  too  near  a  map  feature,  then  it  may  be  removed  from  the  candidate 
list  of  transceiver  locations,  at  the  discretion  of  the  analyst.  An  alternative  is  to  porturb  the  location 
slightly,  until  a  new  site  is  goierated  to  meet  the  requirnnent.  This  latter  action,  however,  often 
relocates  the  site  to  a  lower  elevation,  whoe  it  may  not  be  appro{Mriate  to  place  a  transceiver  because 
of  poor  field  of  view. 

Previous  research  at  the  Intelligence  Electronic  Warfare  Directorate  resulted  in  a  set  of  high- 
performance  algorithms  designed  to  compute  the  distance  from  a  query  point  to  a  set  of  map  features 
[Cl].  These  or  similar  al^rithms  may  be  utilized  in  a  preprocessing  step  to  remove  those  candidate 
sites  deemed  too  near  map  features.  Alternatively,  if  one  desires  to  keep  the  sites,  the  failed 
proximity  constraints  notwithstanding,  then  each  site  may  be  tagged  to  indicate  noncompliance  with 
doctrinal  proximity  specifications. 


The  division  aiea  of  interest  and  forward  edge  of  battlefield  activity  constraints. 

During  planning  operations,  an  Army  division  is  assigned  a  division  area  of  interest  (DAI). 
Doctrinally,  a  DAI  is  a  rectangular  region  of  about  20km  X  30m,  although  the  shape  and  dimensions  are 
dependent  upon  map  context.  For  the  asset  managenwnt  problem,  the  DAI  frontage  boundary  may  be 
used  to  filter  potential  transceiver  placement  locations.  The  DAI  frontage  boundary  may  be  viewed  as 
a  line  segment  which  segregates  potential  red  transceiver  locations  from  blue. 

During  cortflict,  there  is  sometinres  an  implicit  hypothetical  boundary  visualized  or  drawn  by 
an  analyst  between  the  frontages  of  friendly  and  opposing  forces.  This  boundary,  called  the  forward 
edge  of  battlefield  activity  (FEBA),  imposes  yet  anodter  constraint  on  friendly  transceiver  placement. 
The  FEBA  may  be  perceived  as  a  context-sensitive  perturbation  of  the  front  edge  of  the  DAI. 

Geirerally,  it  is  ill-advised  to  place  a  transceiver  on  the  side  of  the  FEBA  occupied  by  an  opponent,  so 
that  oiUy  those  candidate  sites  in  friendly  tnritory  need  be  cmisiderBd.  Onoe  again,  an  analyst  may 
override  the  DAI  ot  FEBA  restrictions  in  the  same  way  he  may  override  a  proximity  constraint,  but  in 
most  cases  he  will  find  it  prudent  to  allow  the  coitstraint  to  stand.  If  he  does  relax  the  constraint  and 
allows  the  sites  to  persist  in  the  database,  then  the  asset  maiagemertt  system  may  tag  the  sites  as 
having  failed  the  DAI  or  FEBA  constraints. 


Computing  the  hearability  graph. 

Once  the  proximity,  division  area  of  interest,  and  forward  edge  of  battlefield  activity 
constraints  (given  they  are  available)  have  been  exploited,  the  list  of  candidate  transceiver  placement 
sites  may  be  subjected  to  constraints  imposed  by  radio  fivquency  (RF)  power  propagation  modeling.  RF 
line  of  sight  combines  power  loss  with  optical  line  of  sight.  Using  RF  propagation  models,  one  may 
determine  if  a  candidate  site  is  within  a  specified  power  contour  radially  produced  from  another 
location.  A  power  contour  is  derived  by  performing  an  RF  line  of  sight  computation  from  one  location  to 
another,  and  the  contour  is  labeled  with  the  signal  to  noise  ratio,  expressed  in  decibels  (dB),  of  a 
hypothetical  communication.  Runiung  the  RF  propagation  model  over  all  sites  in  the  candidate  list 
produces  a  graph.  The  nodes  in  the  graph  are  candidate  site  selections,  and  an  edge  represents  certitude 
that  a  node  can  hear  another  node  at  a  specified  power  level.  Observe  that  a  node  may  represent  an 
entire  region  of  RF  reception,  and  is  not  necessarily  just  one  point  on  the  map.  If  two  nodes  are  connected 
by  an  edge,  then  they  can  communicate  with  each  other  at  a  specific  power  level,  which  means  that  one 
can  hear  the  conununications  of  the  other.  Conversely,  the  lack  of  an  edge  between  nodes  indicates  that 
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there  is  no  netwcnrk  capability  between  them.  For  descriptive  purposes,  we  call  the  resultant  connected 
graph  a  transceiver  heaiat^ty  graph.  In  one  such  graf^  at  Hgure  5,  potential  site  locations  are 
linked  to  ottter  hearable  locations,  at  various  power  levels. 


Figure  5.  A  transceiver  hearability  graph. 


Uninfonned  surveillance. 

If  there  is  no  information  available  concemii^  the  locations  of  red  transceivers,  then  blue  forces 
can  adopt  one  of  a  variety  of  strategies  to  become  infom^.  One  strategy  is  to  place  k  transcdvers  on 
the  hearability  graph  at  roughly  equal  distances  across  the  frontage  of  the  DAI,  while  utilizing 
transceivers  in  receiver  mode.  This  is  somewhat  analogous  to  a  zone  defense  employed  by  professional 
football  teams.  Another  strategy  is  to  use  an  unconstrained  optimization  technique  such  as  simulated 
annealing  to  find  foe  best  optical  field  of  view  locations  for  blue  transceivers,  place  transcdvers  at 
those  locations,  and  trust  that  red  transceivers  do  not  deploy  in  invisible  areas.  Yet  another  strategy  is 
to  go  into  surveillance  mode  with  the  transceiver  capable  of  acquiring  the  best  field  of  view,  usually  an 
airborne  platform,  and  scan  the  DAI  until  activity  is  detected.  Once  activity  is  detected,  blue  forces 
may  begin  to  de(^y  transceivers  to  populate  the  hearability  graph,  motivated  by  the  requirement  to 
monitor  red  activity. 


Hypothesizing  a  communications  network  of  opposing  forces. 

With  foe  DAI,  FEBA,  hearability  graph,  and  other  exploitable  constraints  in  hand,  one  nnay 
proceed  to  h)qx)thesize  the  deployment  of  a  communications  network  managed  by  opposing  forces.  If  an 
opponent  is  re^x)nsible  for  managing  k  transceivers,  then  the  k  devices  must  populate  the  hearability 
graph  in  such  a  way  as  to  define  a  network  on  the  hostile  side  of  foe  FEBA.  No  single  red  transceiver  is 
permitted  to  be  outside  foe  communications  range  of  every  other  red  transceiver.  Another  way  of 
expressing  this  is  that  at  least  one  other  transceiver  must  be  netted  to  a  given  transceiver. 

The  suspected  locations,  if  they  are  available,  of  opposing  force  transceivers,  are  plotted  on  the 
hearability  graph.  If  a  suspected  location  does  not  correspond  to  a  node  of  the  hearability  graph,  then 
it  may  be  installed  into  the  graph  at  this  time,  its  RF  power  propagation  contours  computed,  and  its 
power  links  connected  to  hearable  red  transceivers.  In  Figure  6,  locations  of  three  red  transceivers  have 
been  hypothesized,  indicated  by  the  shaded  nodes.  The  transceivers  have  been  labeled  RL,  RC,  and 
RR,  which  represent  red's  left,  center,  and  right  respectively.  Note  that  RL  is  netted  to  RC,  which  is  in 
turn  netted  to  RR. 
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Figiure  6.  A  simulated  red  communications  network  depk^ment 


Informed  surveillance  and  parity  matching. 

If  and  when  susf>ected  or  hypothetical  locations  of  red  transceivers  become  available,  due  to 
information  gathered  during  the  intelligence  cycle,  it  becomes  worthwhile  to  attempt  to  identify 
optimal  collection  sites  for  blue  transceiver  placement.  These  sites  must  obey  the  RF  propagation  power 
constraints  imposed  by  the  hearability  graph. 

Parity  matching  is  used  to  implement  a  one-on-one  assignment  of  blue  to  red  transceivers,  with 
an  accompanying  abandonment  of  the  zone  defense  occasioned  by  uninformed  surveillance.  Parity 
matching  is  a  technique  to  counter  every  located  adversarial  transceiver  with  a  friendly  one.  The 
surveillance  regime  transfers  from  uninformed  to  iirformed.  The  underlying  assumption  is  that  for  every 
red  transceiver  deployed,  there  is  a  corresponding  blue  transceiver  available  to  counter  it.  In  Figure  7, 
it  has  been  determine  that  to  cover  red's  center  transceiver  effectively,  there  are  two  possible 
locations,  BCl  and  BC2,  for  blue's  center  transceiver.  It  is  computationally  nnore  efficient  to  begin  the 
allocation  process  with  blue's  center  rather  than  the  extremes  of  the  network. 


Figure?.  Parity  matching:  lining  up  blue's  center  with  red's  center. 


It  is  now  feasible  to  develop  the  topology  (graph-theoretic  deployment)  of  blue's  network.  If 
location  BCl  is  selected  to  be  the  location  of  blue's  center,  then  there  are  two  possible  locations 
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(connected  to  red's  left)  for  blue's  left,  but  only  one  for  blue's  right.  Thus  there  are  two  feasible 
transceiver  network  deployments  using  site  BCl  as  blue's  center  (Fig.  8).  For  blue's  left,  one  might  select 
one  of  the  two  available  locations  BLl  or  BL2  based  on  stronger  signal-to-noise  ratio,  better  logistics 
support,  or  a  variety  of  other  operational  constraints.  Note  that  with  this  topology,  blue's  left 
provides  redundant  coverage,  since  blue's  center  alone  is  capable  of  monitoring  both  red's  left  and  center. 


Figure  8.  Possible  locations  for  blue's  flanks,  given  BCl  center  selection. 

If  site  BC2  is  selected  as  blue's  center  location,  then  there  is  a  unique  allocation  strategy,  which 
puts  blue's  left  at  BL  and  blue's  right  at  BR  (Fif  9).  In  this  instance,  blue's  right  may  be  detached  from 
the  network  since  blue's  center  transceiver  covers  both  red's  center  and  right.  When  this  allocation  is 
added  to  the  two  found  previously,  it  is  seen  that  there  are  three  feasible  transceiver  allocation 
strategies  for  this  small  example.  In  practice,  for  large  scale  problems,  there  are  likely  to  be  many 
allocation  solutions  which  satisfy  a  commander’s  transceiver  allocation  requirement. 


Figure  9.  Possible  locations  for  blue's  flanks,  given  BC2  center  selection. 

Totally  constrained  transceiver  allocation  and  the  stable  marriage  problem. 

In  this  section,  the  algorithmic  complexity  of  the  totally  constrained  transceiver  allocation 
problem  is  discussed.  The  allocation  problem  is  totally  constrained  when  both  the  hearability  graph  is 
available  and  the  locations  of  adversarial  transceivers  are  known.  It  will  be  s.  ^n  that  these 
constraints  make  the  allocation  problem  much  simpler  than  unconstrained  opti.iiization.  However,  it 
should  be  emphasized  that  locational  constraints  are  available  only  infrequently,  during  regimes  of 


infonned  surveillance. 


Theorem.  The  totally  constrained  transceiver  allocation  problem  is  equivalent  to  the  stable  marriage 
problem. 

Proof:  In  the  stable  marriage  problem  (K2],  it  is  required  to  match  k  bachelors  with  k  maidens  in  sudi  a 
way  that  the  resulting  marriages  are  stable.  By  "stable",  it  is  meant  that  there  do  not  exist  "more 
suitable"  nrarriages  between  spouses  which  mi^t  cause  the  given  set  of  ituuriages  to  dissolve.  To 
illustrate,  two  couples  who  prefer  their  own  spouses  to  those  of  another  couple  have  stable  marriages. 
Turning  to  the  transceiver  allocation  problem,  it  may  be  construed  that  blue  receivers  corre^nd  to 
bachelors,  red  transmitters  to  nuidens,  and  signal-to-noise-ratio  between  blue  and  red  to  the  marriage 
"bond".  It  is  possible  to  stabilize  an  unstable  transceiver  allocation  by  requesting  two  blue  transceivers 
to  swap  surveillances.  The  allocation  of  Figure  10  (matching  is  bold)  is  unstable,  since  it  is  possible  to 
find  another  matching  with  greater  signal-to-noise  ratio  (the  50  and  40  dB  links).  To  stabilize  the 
allocation,  one  simply  needs  to  detach  the  crossing  20  dB  links  and  enable  the  50  and  40  dB  links. 


Theorem.  The  totally  constrained  bounded  resoiuxe  problem  is  solvable  in  linear  time. 

Proof:  This  follows  immediately,  since  the  stable  marriage  problem  is  solvable  in  linear  time  [521. 


Unconstrained  vs.  constrained  optimization:  computing  the  savings. 

It  was  shown  above  that  in  an  unconstrained  application  the  time  complexity  to  place  k 
transceivers  on  a  gridded  map  of  dimension  n  is  O  [  n^  ].  It  was  also  shown  that  for  a  totally  constrained 
application,  the  time  complexity  is  O  [  k  ].  The  savings  in  performance  is  the  difference  between  these 
two  functions.  As  the  number  of  transceivers  grows,  the  total  savings  is  the  integral  of  the  difference  of 
the  functions.  The  savings  itself  is  exponential  in  complexity,  as  represented  at  equation  [7]  below. 


t  =  +  bi 

[4] 

t  =  Ejk  +b2 

[5] 

A=J  (a,n^''  +  b,  -Ejk-  bj  )dk 

[6] 

[7] 

Inn  2 
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Specificity  of  constraints  and  algorithmic  complexity. 

It  has  been  demonstrated  that  in  an  unconstrained  transceiver  placement  application,  the  time 
complexity  to  solve  the  allocation  problem  is  exponential  in  the  number  of  transceivers,  while  in  a 
strongly  constrained  regime,  the  complexity  is  linear  in  the  number  of  transceivers.  Although  open 
problems  remain  for  sets  of  weaker  constraints  which  lie  somewhere  between  these  extrema  (see 
tabulation  below),  it  is  surmised  that  the  time  complexity  for  the  weakly-constrained  problem  is 
bounded  between  exponential  and  linear.  For  example,  when  only  the  hearability  graph  is  available, 
but  the  locations  of  adversarial  transmitters  are  unknown,  the  domain  is  similar  to  the  quadratic 
matching  problem,  which  has  time  complexity  O  [  n^  ]. 


UNCONSTRAINED  OPTIMIZATION  O  [  n^k  ] 


1.  Transceiver  sites  permitted  anywhere  on  map 


2, 


Transceiver  sites  confined  to  local  maxima 
Filter  sites  based  on  local  maxima 


3.  Proximity  to  interfering  map  features  is  known 

Filter  sites  based  on  proximity/inclusion 

4.  Division  area  of  interest  (DAI)  known 

Filter  sites  based  on  DAI  frontage 


5.  Forward  edge  battlefield  activity  (FEBA)  known 

Filter  sites  based  on  FEBA  frontage 

6.  Optical  line  of  sight  bitmaps  available  for  some  lo<ations 

7.  Electronic  line  of  sight  bitmaps  available  for  some  locations 

Compute  partial  hearability  graph 

8.  Optical  Hire  of  sight  bitmaps  available  for  all  locations 

9.  Electronic  line  of  sight  bibnaps  available  for  all  locations 

Compute  complete  hearability  graph 


10. 


Location  of  some  red  transceivers  suspected 
Find  possible  blue  assignments 


11.  Network  topology  of  two  or  more  red  transceivers  suspected 

Partial  one-on-one  blue/red  matchings 

12.  Location  of  all  red  transceivers  suspected 

13.  Network  topology  of  all  red  transceivers  suspected 

One-on-one  parity  matching  of  blue  on  red 
Unstable  matching  (weak  S/N  ratios  permitted) 

Stable  matching  (optimal  S/N  ratio  matching  plotted) 


*****•*••**•••••••••**•*•*»***»••••••••*•••*••**•*****»•*»• 

SEVERELY  CONSTRAINED  OPTIMIZATION  O  [  k  ] 
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Massive  parallelism:  the  proposed  tByte-LoSight  machine. 


When  devdoping  an  ensemble  of  field  of  view  bitmaps  corre^nding  to  specific  sites  within  a 
gridded  database,  it  is  possible  to  gain  algorithmic  performance  by  utilizing  concurrency  wherever 
possible.  The  proposed  tbyte-LoSight  machine  is  a  massively  parallel  architecture  concept  designed  to 
perform  field  of  view  bitmap  constructions  and  cooperative  transceiver  placenwnt.  The  machine  as 
envisioned  is  dimensioned  as  a  1000  x  1000  array  of  processors,  with  each  processor  having  access  to  one 
megabyte  of  local  RAM.  The  million  processors  have  a  combined  access  to  a  million  bytes  of  local  RAM 
-  hence  the  tByte  notation. 

Fidd  of  xnew  calculations.  A  processor's  location  in  the  tByte-LoSight  array  is  analogous  to 
the  corresponding  location  in  Digital  Terrain  Elevation  Data,  Digital  Elevation  Matrix,  or  similar 
gridded  array.  Each  processors  RAM  is  preloaded  with  the  values  from  the  gridded  database.  Then,  in 
parallel,  the  field  of  view  bitmap  Xi,j  for  each  location  (i,j)  is  produced  for  each  element  of  the  array, 
and  is  used  to  overwrite  the  grid  data  in  local  RAM.  With  this  architecture,  tByte-LoSight  is  capable 
of  storing  up  to  a  million  bitmaps,  each  one  megab)'te  in  size.  Using  an  approxinutte  field  of  view 
algorithm,  to  produce  an  ensemble  of  n^  bitmaps,  the  performance  of  the  architecture  is  O  [  n^  ].  An 
alternative,  more  finely-grained  architecture  replaces  each  processor  with  an  n  x  n  array  of  processors, 
which  speeds  up  the  bitmap  generation  process  to  O  [  n  ].  The  performance  improvement  is  due  to  each 
low  level  processor  computing  a  single  LOS  radius  rather  than  n^  radii. 

Transceiver  placement.  When  during  asset  management  processing  a  transceiver  is  tentatively 
placed  at  cell  (i,j),  the  tByte-LoSight  machine  fetches  bitmap  Xi,j  and  inverts  it  to  produce  the 
complement  of  Xi,j ,  denot^  ~Xi,j .  Concurrently,  tByte-LoSight  then  computes  the  set  differmce  between 
each  of  its  elements  and  Xij ,  followed  by  the  set  difference  between  and  the  result,  while 
progressively  updating  local  RAM.  Finally,  a  logical  operation  is  performed  to  find  the  processor 
whose  RAM  most  closely  resembles  the  null  bitmap,  for  this  location  is  deemed  to  be  the  best  site  for  a 
second  transceiver.  From  the  second  site's  location,  this  logic  may  be  iterated  until  the  summed 
visibility  ratio  approaches  one,  until  the  supply  of  transceivers  is  exhausted,  or  until  it  is  judged  that 
no  further  improvement  is  possible.  The  tByte-LoSight  architecture  is  currently  just  a  concept.  The  cost 
and  reliability  of  such  a  massive  architecture  are  issues,  as  are  data  transfer  rates,  power  requirements, 
and  communications  polling  costs. 


Conclusions. 

It  has  been  shown  that  the  Intelligence  Elecfronic  Warfare  transceiver  allocation  problem 
exhibits  a  wide  spectrum  of  algorithmic  complexity,  depending  upon  the  availability  of  constraints  to 
bound  the  problem.  At  one  extreme,  when  no  constraints  are  available,  the  time  complexity  required  for 
an  optimal  field  of  view  solution  is  exponential  in  the  number  of  transceivers  allocated.  C)n  the  other 
hand,  for  the  tightly  constrained  problem,  when  the  radio  frequency  propagation  graph  is  known,  along 
with  suspected  locations  of  the  transmitters  of  interest,  the  time  complexity  is  linear  in  the  number  of 
allocated  transceivers,  since  the  problem  reduces  to  Knuth's  stable  marriage  theorem.  It  is  speculated, 
but  has  not  yet  been  proven,  that  when  only  weaker  constraints  are  available,  the  complexity  falls 
somewhere  in  between.  Another  result  to  emerge  from  this  research  is  a  new  technique  to  place 
transceivers  on  the  fringes  of  areas  invisible  to  an  already  placed  transceiver.  Finally,  a  massively 
parallel  architecture  has  been  conceptualized  to  support  both  field  of  view  and  transceiver  placement, 
although  infeasible  power  requirements  preclude  implementation  in  the  short  term. 
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ABSTRACT 

The  Concepts  Evaluation  Model  Vn  (CEM  VII)  is  the  latest  version  in  the  progres¬ 
sion  of  US  Army  theater  force  model  development  It  has  been  developed  under  the  auspices  of  the 
US  Army  Concepts  Analysis  Agency  (CAA).The  model  is  used  by  the  Army  to  assess  and  opti¬ 
mize  combat  force  capability.  Typically,  the  model  involves  battle  simulations  of  many  months 
duration,  encompassing  multiple  armies  and  diverse  terrain. 

CEMPLOT  is  a  visualization  tool  used  to  portray  the  results  of  batde  simulations 
performed  in  CEM  Vn.  It  fulfils  the  need  for  a  user-friendly,  interactive  graphics  tool  with  different 
display  options.  The  interface  is  developed  to  run  in  the  X  Wmdows  and  OpenWindows  environ¬ 
ments.  The  program  is  written  in  C  and  it  utilizes  Xlib  and  XGL  libraries.  The  user  selects  various 
frames  of  information  from  a  menu,  developed  using  the  X>^ew  toolkit 

Some  of  the  options  offered  to  the  user  are:  1)  di^lay  of  die  battle  terrain,  2)  display 
of  the  location  of  the  Forward  Edge  of  the  Battle  Area  (FEBA)  of  the  two  conflicting  forces  at  dif¬ 
ferent  time  steps,  3)  display  of  resources  such  as  number  of  artillery  pieces  and  close  air  support 
units  for  the  assembled  army  units,  4)  display  of  the  names  of  army  units  in  combat  and  5)  infor¬ 
mation  regarding  the  personnel  and  ammunition  losses  incurred  by  the  two  opposing  forces  at  dif¬ 
ferent  time  intervals. 
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1.  INTRODUCTION 


1.1  Historical  Perspective 

The  CoiKepts  Evaluation  Model  Vn  (CEM  VII)  is  the  discrete  event  battle  simulation 
model,  which  exists  under  the  auspices  of  the  US  Anny  Concepts  Analysis  Agency  (CAA),  located 
in  Bethesda,  Maryland.  CEM  Vn  is  used  by  dw  Army  to  assess  and  optimize  combat  force  cipa- 
bility.  The  kernel  of  CEM  Vn  capability  is  the  ATCAL  (Attrition  Model  Using  Calibrated  Param¬ 
eters)  algoridun  [Johnsnid,  1980]  which  is  used  to  perform  engagements.  CEMPLOT  is  the 
visualization  tool  developed  at  the  Department  of  Mechanical  Engineering,  Colorado  Stale  Uni- 
versiQr,  to  di^lay  and  interpret  information  produced  by  CEM  VIL  Typically,  CEM  Vn  involves 
simulations  of  many  months  duration,  encompassing  multiple  armies  and  diverse  terraiiL  The  util¬ 
ity  of  the  simulation  lies  in  the  degree  to  which  die  informadon  produced  by  one  or  more  runs  of 
the  model  can  be  interpreted.  A  PC  plotting  program  [Stoll,  1992]  exists,  which  visualizes  CEM 
vn  ott^ut  data.  CEMPLOT,  besides  duplicating  die  dinctionality  of  the  PC  program  in  a  Sun  X 
environment,  presents  the  user  with  additional  features  to  help  visualize  CEM  ouqiut 

1.2  CEMPLOT  Program  Overview 

CEMPLOT  runs  in  the  X  T^dows  and  OpenWindows  environments  and  uses  die  XGL 
graphics  library  extensively  (XGL  2.0).  It  uses  a  gn^ihics  toolkit  X\KVf  to  provide  user-interface 
objects  such  as  menu  buttons.  The  program  has  the  capability  to  di^lay  CEM  output  data  in  color 
and  accepts  informadon  and  commands  from  the  user,  interacdvely.  The  user-interface  consists  of 
a  base  frame  or  window  which  contains  die  graphics  canvas  (e.g.,  where  the  terrain  and  FEBA  are 
plotted)  and  the  input  panel  (control  buttons).  The  button  items  include  panel  buttons  which  per¬ 
form  a  funcdon  when  clicked  on,  and  menu  buttons  which  allow  the  user  to  select  a  pardcular 
opdon  from  a  menu. 

The  position  of  CEMPLOT  in  die  CEM  hierarchy  is  di^layed  in  the  flow-chart  of  Hgure 
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1,  showing  the  data  files  which  fcMin  the  input  to  the  visualizatuHi  tool  The  (nogram  exists  as  a 
collection  'tf  C  routines  which  implement  the  various  qptions  of  CEMnXJT.  The  (xogram  has  die 
ca(>ability  to: 

(i)  Display  the  battle  terrain. 

(ii)  Display  the  Forward  Edge  of  Battle  Area  (FEBA),  overlaid  on  the  terrain,  for  every 
theater  cycle. 

(iii)  Display  tactical  report  data,  such  as  number  of  personnel,  artillery,  close  air  sup{)ort 
units,  etc.,  for  each  unit  assembled  along  the  FEBA  over  different  theater  cycles. 

^  Plot  theater  cycle  number  versus  logistics  data,  such  as  amount  of  ammunition  avail¬ 
able,  versus  time  step. 

(v)  Present  a  roll-call  of  all  units  assembled  on  the  opposing  sides  at  a  particular  theater 
cycle  (time  step). 

(vi)  Perform  geometrical  transformations  (scaling  and  rotation)  on  the  FEBA/terrain  plot 

(vii)  Animate  the  sequence  of  FEBA  movement 

2.  CEMPLOT:  INPUT  FILES 

CEMPLOT  uses  many  input  files,  one  of  which  is  mandatory.  The  mandatory  input  file  to 
the  visualization  program  is  a  battle  terrain  data  file.  This  file  is  one  of  the  input  files  for  the  pre¬ 
processor  which  is  input  to  CEM  Vn  (refer  Figure  1).  The  other  input  files  include  FEBA,  tactical 
report,  and  logistics  data  files,  which  are  ouqmt  from  the  CEM  Vn  post-processor  [Allison  et  al., 
1985].  The  following  sections  discuss  the  data  contained  in  these  files  in  greater  detaiL 
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Terrain  data  file 


Figure  1  Position  of  CEMPLOT  in  the  CEM  hierarchy 


2.1  Battle  Terrain  Data 

The  CEM  uses  a  simplified  representation  of  the  battlefield  [Johnson,  1985].  The  distances 
in  the  direction  of  troop  movement  (vertical)  are  in  kilometers  with  a  minimum  distance  of  one- 
tenth  of  a  kilometer.  Distances  along  the  front  (horizontal)  are  measured  in  minisectors  (see  Figure 
2).  The  battle  terrain  is  diverse  in  nature  and  is  categorized  as  types  A,  B,  C  and  D.  The  four  terrain 
types  denote  die  general  nature  of  the  terrain  and  have  some  bearing  on  the  mobility  of  ground 
combat  units. 

(i)  Type  A:  This  terrain  is  mainly  flat  and  is  excellent  tank  country. 


132 


(ii)  IVpe  B:  Rolling  lands  and  is  marpnally  suited  for  tanks  and  wheeled  vehicles. 

(iii)  IVpe  C:  Moiuitainous  territofy.  On  this  terrain,  tanks  and  wheeled  vehictes  must 
remain  on  the  roads  because  of  steep  slopes. 

(iv)  IVpe  D:  It  represents  some  major  obstacle  diat  would  normally  require  extra  or  spe-  ’ 
cial  effort  for  the  forces  to  negotiate  or  pass  through.  It  may  be  a  river,  lake,  marsh, 
canyon  or  some  man-made  barrier,  such  as  a  minefield. 

The  terrain  data  file  contains  information  regarding  number  of  engagements,  number  of 
minisectors  and  the  terrain  extents  for  sets  of  minisectors.  The  depth  of  a  barrier  of  D  terrain  is  set 
to  the  minimum  distance  of  movement  of  one-tenth  of  a  kilometer.  For  example,  a  particolar  terrain 
band,  say,  10  minisectors  wide,  may  be  comprised  of  3.7  km  of  Type  B,  followed  by  9  km  of  Type 
A,  followed  by  0. 1  km  of  Type  D,  etc. 


Subsector  minisector 
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^  minisectors 


Figure  3  Terrain/FEBA  Map  From  CEMPLOT 

2^  Unit  Deployment  Data 

The  deployed  forces  are  resolved  to  brigatfe  for  die  Blue  side  and  division  for  Red.  Figures 
2  and  3  illustrate  the  deployment  of  a  Blue  force  opposing  a  Red  force.  A  Blue  brigade  front  (such 
as  tte  distance  AB  in  Figure  2)  or  a  Red  division  front  (e.g.  EF)  is  defined  as  a  sector.  The  forward 
line  of  assembled  troops  (Blue  and  Red)  is  termed  the  FEBA  (Forward  Edge  of  Batde  Area).  The 
engagements  between  the  two  forces  results  in  the  forward  and  backward  movement  of  the  FEBA, 
with  time.  The  data  regarding  the  position  of  each  minisector  along  the  FEBA,  for  every  theater 
cycle  is  contained  in  a  data  file,  which  is  output  from  the  CEM  Vn  post-processor.  CEMPLOT 
reads  and  displays  information  from  the  FEBA  data  file. 

23  Tictical  Report  Data 

The  tactical  report  contains  information,  for  every  n*^  division  cycle,  regarding  die  loca- 


tion,  mission,  status,  authorized  troop  strength,  supplies  and  number  of  major  weiqwns  assigned  to 
the  units  in  combat  The  units,  for  which  the  information  is  provided,  include  Blue  brigades  and 
Red  divisions.  The  unit  tactical  report  generated  by  the  CEM  post-processor  is  passed  through  a 
&le  conversion  program  to  maintain  file  format  compatibility  with  dw  existing  PC  CEM  plotting 
package.  The  converted  file  is  then  passed  as  input  to  CEMPLOT. 

2.4  Logistics  Data 

The  logistics  report  presents  data  concerning  the  consumption  and  replacement  of 
resources  by  the  combat  units.  It  includes  the  total  amount  of  resource  lost,  both  temporarily  and 
permanently,  due  to  both  combat  and  noncombat  causes  since  the  start  of  engagements.  The 
resources,  include  the  number  of  participating  persoimel  and  the  amount  of  ammunition  expended. 
CEMPLOT  reads  the  logistics  data  and  displays  it  in  the  form  of  Cartesian  plots. 


3.  CEMPLOT:  PROGRAM  OPERATION 


The  program,  CEMPLOT,  is  run  in  the  UNIX  operating  system,  either  in  the  X  \Wndows 
or  OpenWindows  environment  The  program,  to  start  with,  brings  up  an  application  window  or 
base  frame  as  depicted  in  Figure  5.  The  base  firame  has  an  input  panel  which  consists  of  command, 
text  and  menu  buttons,  examples  of  which  are  shown  below  (Figure  4). 


c 


Terrain  data  file 


) 


name.dat 


(a)  Panel  command  button  (b)  Panel  text  button 


SCALE.AXIS:  All  axes 

(c)  Menu  button 

Figure  4  Examples  of  Panel  Buttons 
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Input  Panel 


Figure  5  CEMPLOT  base  frame 

The  user  types  the  names  of  the  CEMPLX)T  input  data  files  in  the  text  space  provided  to  the 
right  of  the  corresponding  panel  text  buttons.  This  action  is  followed  by  clicking  the  particular  text 
button  using  the  left  mouse  button  [Shivaswamy  and  Bums,  1993].  The  terrain  and  FEB  A  m^s 
are  brought  up  in  the  graphics  area  or  the  canvas  of  the  base  frame. 

The  four  different  terrain  types  are  represented  in  CEMPLOT  by  different  colors.  The  ter¬ 
rain  is  displayed  laterally  as  bands,  each  of  which  is  a  pre-specified  number  of  minisectors  wide. 
The  forward  line  of  the  assembled  troops  or  the  FEBA  is  displayed  as  a  combination  of  a  blue  and 
a  red  band,  each  of  fixed  height  (refer  to  Hgures  3  and  5).  The  individual  units  or  sectors  on  each 
side  fall  on  minisector  boundaries  and  are  delimited  in  CEMPLOT  using  vertical  yellow  lines 
along  the  FEBA. 
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4.  CEMPLOT:  USER  OPTIONS 


These  commands  are  activated  by  clicking  the  left  mouse  button  over  the  required  option 
button.  The  options  available  under  CEMPLOT  are  discussed  below. 

4.1  Print  Unit  Data 

“Unit  Data”  is  the  default  option  under  the  CEMPLOT  main  menu.  This  option  prints  the 
data  regarding  number  of  personnel,  .supplies  and  major  weapons  for  the  “picked”  combat  unit  The 
option  brings  up  the  terrain  map  with  the  FEB  A  overlaid  on  it  The  user  “picks”  a  unit  by  driving 
the  cursor  onto  that  particular  unit  (between  two  adjacent  yellow  lines)  and  clicking  on  it  using  the 
left  mouse  button.  The  unit  thus  picked  is  highlighted  with  an  asterisk.  The  tactical  report  data  per¬ 
taining  to  that  unit  is  displayed  in  an  ouqiut  window  (see  Hgure  6)  which  may  be  moved  around 
or  iconified  just  as  any  other  window.  This  option  retains  its  capability  even  after  the  terrain  and 
FEBA  plot  is  scaled  and  translated. 


Figure  6  CEMPLOT:  Unit  Data  Display 
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42  Print  RoUCaU 


Under  this  option,  the  names  of  all  the  units  present  at  the  end  of  the  current  theater  cycle 
are  displayed.  The  data  are  output,  in  a  manner  similar  to  the  previous  option,  in  a  new  window,  as 
shown  in  Figure  7.  The  data  displayed  include  the  theater  cycle  number  (time  step),  the  names  of 
all  Blue  brigades  and  Red  divisions  in  combat.  The  output  window  can  be  moved  around  or  icon- 
ified  as  in  the  previous  option.  The  terrain  map  with  the  FEB  A  information  overlaid  on  it  will 
remain  displayed  in  the  main  graphics  window  while  running  this  option. 


Figure  7  CEMPLOT:  “RoU  Call”  Output  Window 


43  Show  Logistics  vs.  Theater  Cycle  Graph 

The  total  unit  losses  of  personnel  and  ammunition,  at  each  theater  cycle,  are  plotted  versus 
the  theater  cycle  number  in  Cartesian  form.  This  includes  data  for  each  Blue  brigade  and  Red  divi¬ 
sion.  The  screen  ou^ut  of  this  option  of  CEMPLOT  is  shown  in  Figure  8. 


4.4  Scaling  and  TVanslation 

These  options  allow  the  user  to  scale  or  translate  the  plot  on  the  screen  in  X,  Y,  Z,  or  uni¬ 
formly  in  all  the  three  coordinate  directions.  The  user  selects  the  axis  of  transformation  from  a 
menu  and  the  scaling  or  translation  value  firom  a  slider  bar.  The  scaling  and  translation  values  are 
a  percentage  of  the  screen  plot  and  are  cumulative  in  nature.  Thus,  scaling  first  by  200%  and  then 
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Figure  8  CEMPLOT:  logistics  vs.  cycle  number  graph 

by  50%  results  in  a  cumulative  scaling  of  100%  (unchanged).  These  options  are  particularly  useful 
when  many  combat  units  are  assembled  across  a  few  minisectors,  in  which  case  the  plot  could  be 
scaled  to  yield  mote  detailed  information. 

4.5  *Start%  *Step  +’  and  *Step  Options 

The  battle  engagements  between  the  opposing  forces  take  place  over  a  specified  number  of 
time  steps  or  theater  cycles.  The  above  options  allow  the  user  to  view  data  present  in  the  FEB  A  and 
tactical  reports  for  any  theater  cycle.  The  ‘Start’  button  sets  the  theater  cycle  number  to  zero, 
enabling  the  user  to  view  information  pertaining  to  start  of  battle  engagements.  The  options  ‘Step 
+’  and  ‘Step  increment  and  decrement  the  FEBA  plot  by  one  theater  cycle  number,  correspond¬ 
ingly. 
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4.6  Animation:  ‘Animate  +’  and  ‘Animate 

These  options  animate  the  sequence  of  FEB  A  movement  for  theater  cycles  zero  to  maxi¬ 
mum.  ‘Animate  +’  and  ‘Animate  move  the  FEBA  from  zero  to  maximum,  and  from  maximum 
to  zero,  respectively.  These  options  are  helpful  as  they  provide  the  user  with  an  overview  of  the 
progression  of  the  entire  battle. 

5.  CEMPLOT:  Examples  of  Use 

5.1  Interpreting  Terrain  Features  and  FEBA  Movement 

The  terrain  and  FEBA  map  displayed  by  CEMPLOT  can  be  studied  for  the  distribution  and 
extents  of  each  terrain  type  and  their  influence  on  the  movement  of  forces.  This  can  be  done  very 
easily  by  stepping  through  the  different  theater  cycles  and  studying  the  displacement  of  the  FEBA. 
This  study  can  be  repeated  for  different  terrain  data  sets  and  their  corresponding  FEBA  location 
files.  Figures  9,  10,  and  11,  display  terrain  and  FEBA  data  sets  for  the  Campaign  I,  Europe,  and 
Ardennes  data  sets,  respectively.  Each  figure  displays  the  position  of  the  FEBA  at  the  beginning 
and  end  of  the  simulation  (in  other  words,  at  the  end  of  theater  cycle  number  zero  and  maximum, 
respectively). 

5.2  Parametric  Studies 

CEMPLOT  can  be  used  to  visualize  any  CEM  Vn  simulation  model  across  force  varia¬ 
tions.  It  provides  a  user-friendly  and  robust  tool  for  the  visualization  of  data  generated  by  different 
runs  of  CEM  vn.  CEMPLOT  can  display  several  measures  whereby  one  force  variation  can  be 
compared  to  another.  These  include  the  displacement  of  the  FEBA,  the  total  resourxxs  expended 
by  each  side,  and  the  time  taken  to  arrive  at  a  particular  state  or  condition  of  the  battle.  The  user 
can  achieve  this  very  easily  by  running  multiple  copies  of  CEMPLOT  (invocation  in  the  back- 
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ground  mode),  providing  each  with  data  files  output  from  varying  il^>ut  parameters.  The  separate 
CEMPLOT  windows  thus  generated  can  be  scaled  and  set  adjacent  to  one  another  for  convenience 
in  comparison. 


(a)  Theater  cycle  number  0  (b)  Theater  cycle  number  15 

Figure  9  Camimign  I  data  set 


5,3  Evaluation  of  CEM  VII  Algorithms 

CEM  Vn,  including  its  pre-  and  post-processors  occupy  nearly  200  MegaBytes  of  disk 
space.  The  post-processor  outputs  up  to  18  different  data  files,  some  of  which  are  in  binary  format 
The  data  files  are  very  large  in  size  and  are  daunting  to  interpret  textually.  The  massive  amounts  of 
data  generated  mandate  that  a  special  means  of  display  be  developed  in  order  to  interpret  and  dis¬ 
play  them.  CEMPLOT  seeks  to  fulfill  this  need,  in  a  user-friendly  fashion,  in  real  time,  in  a  UNIX 
workstation  enviromnent 

Efforts  at  Colorado  State  University  [Brewer  and  Burns,  1991]  have  been  directed  towards 
the  vectorization  of  the  ATC AL  algorithm  and  its  insertion  into  CEM  Vn,  CEMPLOT  has  proved 
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(a)  Theater  cycle  number  0  (b)  Theater  cycle  number  15 

Figure  10  Europe  data  set 

MatatmltiBiap'K  tnst  to  Ml  •••  •  B 


IvnlaC^U  TmUI:l 


I  !iaBiaia*a««iii»a 

■ 

TirriMlJHi  IvraittJilHl 


(a)  Theater  cycle  number  0  (b)  Theater  cycle  number  15 

Figure  11  Ardennes  data  set 


to  be  very  useful  in  interpreting  the  results  of  vectorization  and  in  comparing  them  to  those 
obtained  from  unvectoiized  algorithms.  The  evaluation  is  done  by  exploring  die  convergence  of 
the  unvectorized  and  vectorized  codes.  The  global  variation  of  FEB  A  with  time  is  adopted  as  a  met¬ 
ric  of  convergence,  which  is  dien  studied  from  the  display  generated  by  CEMPLOT. 

6.  Future  Work 

6.1  Incorporating  Posture  Information 

Each  combat  unit,  r ^ :  jived  to  Blue  brig^e  and  Red  division,  is  associated  with  a  particular 
posture  for  the  engagement  [Allison  et  al.,  1985].  This  may  be  one  of  the  following  eight  types:  (1) 
Blue  attack;  Red  delay,  (2)  Blue  attack;  Red  prepared  defense,  (3)  Blue  attack;  Red  hasty  defense, 
(4)  Blue  attack;  Red  attack,  (5)  Red  attack;  Blue  hasty  defense,  (6)  Red  attack;  Blue  prepared 
defense,  (7)  Red  attack;  Blue  delay,  and  (8)  Static.  Efforts  are  presently  underway  to  display  this 
information  for  all  the  assembled  units  at  the  end  of  each  theater  cycle. 

6.2  3-Dimensional  Fractal  Display  of  Terrain 

A  realistic  interpretation  of  the  battle  terrain  data  has  been  attempted  making  use  of  the 
technique  of  fractals.  This  effort  involves  modeling  the  terrain  data  using  a  firactal  generating  algo¬ 
rithm  in  a  3-dimensional  format  The  generation  and  display  of  such  a  model  has  been  achieved 
and  efforts  are  underway  to  incorporate  it  as  an  option  under  CEMPLOT. 
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THEORY  OF  A  BIFURCATING  MODEL  NEURON: 
A  NONLINEAR  DYNAMIC  SYSTEMS  APPROACH* 


N.H.  Farhat,  M.  Ektelhiwy  and  S‘Y.  Un 
University  Pennsylvania 
Electrical  Engineering  Department 
PhUadelphia.  PA  19104 


ABSTRACT.  A  nonlinear  dynamic  systems  approach  to  mathematical  analysis  of  the 
integrate-and-fiie  (I  &  F)  model  neuron,  a  monoionic  simplification  of  the  Hodgkin-Huxley  model 
for  action  potential  generation  in  the  excitable  biological  membrane,  is  present^.  It  is  shown  that 
under  periodic  activation  (driving  signal),  as  would  arise  in  a  synchronized  network  of  such 
neurons  when  dendritic-tree  processing  is  assumed,  the  firing  behavior  is  described  by  an  iterative 

map  of  the  interval  [0, 27t]  onto  itself  which  we  call  phase-transition  map  (PTM).  Like  other  maps 
of  the  interval  cmto  itself,  the  PTM  can  be  studied  employing  the  tools  of  nonlinear  dynamics.  This 
furnishes  a  novel  way  of  viewing  the  microneurodynamics  of  neural  networks  and  shows,  that 
despite  the  simplifications  made  in  the  I  &  F  model  neuron,  it  exhibits,  in  its  spiking  behavior,  a 
high  degree  of  functional  complexity  approaching  that  of  the  living  neuron.  This  is  manifested  by 
a  variety  of  firing  modalities  depending  on  parameters  of  the  periodic  activation,  which  include 
regular  firing  phase-locked  to  the  periodic  activation  or  a  subharmonic  of  it,  quasi-periodic  firing, 
erratic  firing,  and  bursting  and  can  bifurcate  (rapidly  switch)  between  these  firing  modalities  as  the 
parameters  of  the  periodic  activation  are  altered,  hence  the  name  bifurcating  neuron.  Illustrative 
examples  of  this  complex  behavior  are  given  in  the  form  of  bifurcation  diagram,  Arnold  Tongues 
diagram  and  the  DeviTs  Staircase  diagram.  These  show  the  bifurcating  neuron  is  able  to  detect 
coherent  episodes  in  its  incident  spike  wavefront,  the  aggregate  of  spike  trains  incident  on  its 
synaptic  inputs,  and  encodes  such  coherent  activity  whenever  it  occurs,  in  a  complex  manner 
depending  on  ^e  parameters  of  the  periodic  activation  potential  produced  by  dendritic-tree 
processing  of  the  incident  spike  wavefront.  When  the  activation  potential  is  not  periodic,  the 
bifurcating  neuron  reverts  to  the  usual  sigmoidal  response  and  shows  an  upper  limit  on  firing 
frequency  which  serves  the  useful  function  of  containing  the  maximum  firing  activity  in  a  network 
of  such  neurons  in  a  manner  analogous,  but  not  exactly  equivalent,  to  a  similar  limit  imposed  by 
refractoriness  in  the  living  neuron. 

The  bifurcating  neuron  model  combines  functional  complexity  with  structural  simplicity 
and  low  power  consumption  (because  of  its  spiking  nature).  Accordingly,  it  is  ideal  for  use  in 
modeling,  simulation,  or  hardware  implementation  of  a  new  generation  of  neurocomputers  in 
which  synchronicity,  bifurcation,  and  chaos,  which  are  believed  to  underlie  higher-level  cortical 
functions,  can  be  studied. 

INTRODUCTION.  Most  neural  net  models  assume  sigmoidal  neurons  and  therefore 
cannot  account  for  the  relative  timing  of  action  potentials  (spike  patterns)  impinging  on  the 
dendritic-tree  of  a  neuron  from  its  presynaptic  neurons.  The  bifurcating  neuron  concept  ^d  model 
[1]  is  an  outcome  of  using  the  tools  of  nonlinear  dynamics  to  characterize  the  behavior  of  the 
neuron's  excitable  (axonal)  membrane  in  a  manner  that  accounts  naturally  for  temporal  effects.  It 
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combines  functkmal  complexity  paralleling  that  of  the  living  neuron  with  shr  .  airal  simplicity  and 
low  power  consumption  which  n^es  it  an  attractive  building-block  for  a  new  generation  of  neural 
networks  specially  suited  for  modeling  higher-level  cortical  functions  such  as  feature-bin^g, 
cognidon,  infeiencing,  attention,  reasmung,  etc. 


There  is  increasing  evidence  in  the  literature  encouraging  speculation  that  such  higher-level 
functions  may  involve  synchronicity,  phase-locking,  bifurcations,  erratic  (chaotic  firing  for 
possible  adaptive  annealing),  and  dynamic  sign^  dependent  (adaptive)  partitioning  through  which 
populations  of  cratical  neurons  may  fleetingly  divide  into  sub-populations  or  assemblies,  each  with 
a  prescribed  relative-phase  pattern  between  its  spiking  neurons,  that  act  in  parallel  in  solving 
complex  tasks  with  each  sub-population  acting  collectively.  Thus  dynamic  adaptive  partitioning  is 
the  mews  by  which  cortical  networks  may  carry  out  collective  computations  in  several  sub¬ 
populations  of  neurons  in  parallel.  The  significance  of  collective/parallel  processing  was  recently 
discussed  by  Zak  [2]  in  the  context  of  Nonlipschitzian  neural  networks  with  unpredictable 
dynamics. 

The  relative  timing  and  synchronicity  in  firing  of  neurons  as  a  mechanism  for  feature- 
binding  was  predicted  by  Marlsburg  [3]  and  Abeles  [4]  on  theoretical  grounds.  Recently, 
synchronization  effects  in  the  firing  of  neurons  at  different  recording  locations  in  the  cat's  visuk 
cortex  upon  suitable  visual  stimulus  have  been  observed  [5]-[ll],  raising  thereby  intriguing 
speculation  about  the  nature  of  cortical  processing,  and  stimulating  wi^  spre^  interest.  Temporal 
effects  in  the  olfactory  bulb  have  also  been  studied  extensively  by  Freeman  [12]  and  claims 
regarding  the  implications  of  rythmic  firing  activity  in  consciousness  have  been  covered  by  the 
media  [13].  Further  discussion  of  synchronization  effects  in  tempooal  neural  netwoiics  is  given  in 
[14]-[16].  It  is  now  conjectu^  that  synchronized  firing  of  neurons  in  the  visual  cortex,  for 
example,  might :  (a)  label  spatially  coherent  or  common  features  in  the  visual  data  such  as  nnotion, 
conmst,  texti^,  or  color,  (b)  play  a  role  in  feature-binding,  cognition,  and  other  higher-level 
cortical  functions,  (c)  could  play  a  role  in  information  absorption  (learning)  optimization  in 
biological  systems.  Obviously,  judging  the  validity  of  this  conjecture  requires  thorough 
understwding  of  the  mechanise  that  can  cause  neurons  at  separate  locations  to  synchronize  their 
firings  including,  computer  simulations  aimed  at  studying  and  understanding  the  collective 
dynamics  of  spiking  neural  networks  whose  neurons  are  not  simple  processing  element  but 
possess  functional  complexity,  like  that  of  the  bifurcating  neuron  model,  that  would  reflect  itself 
ultimately  in  the  computing  power  of  the  network  as  a  whole. 

In  this  paper  we  present  the  results  of  a  formulation  of  a  bifurcating  neuron  theory  which 
enables  applying  the  power  and  the  tools  of  nonlinear  dynamics  to  characterizing  its  behavior  in  a 
noanner  that  permits  gi^ter  insight  in  neurodynamics  and  to  possible  utilization  of  synchronization, 
bifurcation  and  chaos  in  the  design  of  a  new  generation  of  powerful  neuroconqiut^. 

The  goal  is  to  develop  a  bifurcating  neuron  theory;  that  is  descriptive,  predictive  and 
qualitatlYC-  Descriptive  in  the  sense  that  it  should  give  true  physical  insight  into  the  complicated 
processes  involved.  In  the  words  of  P.M.  Koch  [17],  "a  useful  theory  describes  the  essential 
physics  of  a  process  simply,  preferably  with  figures  and  simple  equations  whose  behavior  with 
variatirai  of  parameters  can  be  explored.  The  theory  must  be  predictive  because  unless  it  can  raise 
new  questions  a^  predict  answers,  it  will  likely  be  a  dead  end".  Experimenters  warm-up  to 
theories  that  say  "if  you  do  such-and-such,  then  you  will  observe  thus-and-so".  TTie  theory  needs 
to  be  also  quantitative  because  something  is  missing  if  it  cannot  be  reduced  to  calculations  that  "get 
the  numbers  right". 

All  these  gods  are  achieved  in  the  theory  presented  here.  To  them  we  need  to  add,  from 
the  outset,  the  essential  features  we  seek  in  a  neuron  noodel  evolving  from  any  developed  theory. 
These  features  are: 


146 


Production  of  an  action  potential  or  spike 


•  Existence  of  a  refractory  period  to  limit  the  maximum  firing  frequency  to  conserve 
energy  and  eliminate  reverteradons  and  reflections  in  a  netwcHk 

•  Ability  to  integrate  incoming  synaptic  inputs  and  to  account  for  passive  and/or  active 
spado-temporal  processing  carried  out  by  the  neuron's  dendridc-tree 

•  Ability  to  respond  to  external  signals  in  a  manner  that  resembles  that  of  the  living 
(biolo^cal)  neuron,  i.e.,  pioducdon  of  spiking  acdvity  similar  to  that  observed  in  the 
biological  neuron  under  efferent  forms  of  stimulus. 

•  Be  structurally  simple  and  possess  low  power  consumption  in  order  to  facilitate 
hardware  implementation  of  neurocomputering  structures. 

The  starting  point  is  the  Hodgkm  and  Huxley  model  of  the  biological  nnembrane  [IS]  which 
describes  membrane  dynamics  and  spiking,  the  production  of  action  potentials,  in  terms  of  three 
ionic  currents.  The  shape  of  the  action  potential  (spike)  produced  by  this  model  resembles 
faithfully  that  observed  in  the  living  (biological)  neuron  and  exhibits  refractoriness.  The  distinct 
shape  of  the  action  potential  is  determined  by  the  interplay  between  the  dynamics  of  the  three  ionic 
currents  in  the  model.  Because  the  shape  of  all  action  potentials  in  biological  networks  is  more  or 
less  the  same,  one  can  argue  that  the  shape  does  not  convey  information  and  that  only  the 
interspike  interval  (the  interval  between  action  potentials)  and/or  the  relative  timing  between  action 
potentials  in  a  network)  are  important  in  neuroprocessing  of  information.  This  argument  has 
prompted  us  to  simplify  the  H-H  model  to  a  monoionic  current  model  in  order  to  facilitate  its 
analysis  as  a  dynamical  system.  This  is  done  in  Section  2  for  two  types  of  activation:  nonperiodic, 
which  is  shown  to  lead  to  the  usual  sigmoidal  response  and  perit^ic,  which  is  assumed  to  arise 
when  the  dendritic-tree  receives  correlated  spike  trains,  i.e.,  coherent  spike  wavefronts  and  is 
shown  to  lead  to  complex  functional  modalities  far  excee^g  that  of  the  sigmoidal  neuron  model. 
In  Section  2  we  also  present  examples  of  applying  the  bifurcating  neuron  theory,  specifically  the 
PTM,  to  characterizing  the  behavior  of  two  bifurcating  neuron  embodiments  that  illustrate  the 
complex  behavior  of  the  model.  Concluding  remarks  and  implications  of  such  functional 
complexity  and  other  properties  of  the  bifiucating  neuron  are  discus^  in  Section  4. 


2.  ANALYSIS.  Figure  1(a)  shows  an  e^valent  circuit  of  the  monoionic  model.  In  it 
I  is  the  monoionic  membrane  currently  representing  an  energy  source  for  restoring  the  membrane 
potential  after  firing,  R  and  C  are  the  membrane  resistance  and  capacitance  respectively,  v  is  the 

capacitor  voltage,  i  =  ({>  (v')  represents  the  membrane  nonlinearity,  assumed  to  be  S-shaped  and 
u(t)  is  the  activation  potential  of  the  membrane.  u(t)  represents  the  effect  produced  at  the  biological 
neurons  hillock  by  synaptic  inputs  to  the  neuron.  A  piece- wise  linear  approximation  of  the  S- 
shaped  nonlinearity  is  shown  in  Fig.  2  after  inclusion  of  the  effect  of  the  activation  potential  u(t) 
which  represents  the  modulation  in  membrane  potential  produced  by  integration  of  action  potenti^s 
(input  spike  trains)  arriving  at  the  neuron's  dendritic-tree  from  its  presynaptic  neurons,  llie  circuit 
in  Fig.  1(b)  is  equivalent  to  that  in  Fig.  1(a)  when  the  voltage  source  E  is  set  equal  to  E  =  IR.  It 
represents  the  circuit  diagram  of  an  integrate-and-flre  neuron  or  relaxation  oscillator  neuron  whose 
dynamics  we  will  study  here  in  detail.  Despite  its  simplicity,  this  circuit  will  be  shown  to  exhibit 
complex  behavior,  specially  when  u(t)  is  periodic.  In  Fig.  2,  v^jj  and  Vg^t  are  related  respectively 
to  the  breakdown  or  threshold  voltage  vj  and  the  extinction  voltage  V2  of  the  S-shaped  nonlinearity 

<j>(v).  Notice  that  in  the  absence  of  activation  potential  or  (driving  signal)  u(t),  v'  coincides  with  v 
and  vth  reduces  to  vj  while  Vg^t  reduces  to  v2. 
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Fig.  1.  Equivalent  circuit  of  monoionic  model  Hg.  2.  Piece-wise  linear  aj^noximation 

of  the  excitable  membrane.  of  the  S-shaped  nonlinearity  of  the  neurons 

membrane. 

In  the  absence  of  the  driving  signal  u(t)  and  when  E  >  vj,  the  circuit  of  Fig.  1(b)  behaves  as  a 
nonlinear  relaxation  oscillator.  The  S-shaped  i-v  (current-voltage)  characteristic  i  =  shown  in 
Fig.  2  reduces  to  that  shown  in  Fig.  3,  i.e.,  i  =  <l)(v)  curve  with  a  negative  resistance  region,  a 

breakdown  or  threshold  voltage  vj  and  extinction  voltage  V2.  This  i-v  characteristic  i  =  <l>(v)  is 
seen  to  consist  of  three  distinct  regions  I,  II  and  III.  Regions  I  and  III  are  positive  resistance 
regions  corresponding  to  the  resistance  of  nonlinear  element  i  =  <|>(v)  in  the  off  and  on  states 
respectively,  while  region  II  is  a  negative  resistance  region.  The  distance  cff  in  this  plot  is 
exaggerated  in  comparison  to  that  in  actual  i-v  characteristics  in  order  to  delineate  the  negative 
resistance  region  of  the  plot.  The  behavior  of  the  circuit  in  Fig.  1(b)  is  governed  by  the  nonlinear 
differential  equation, 

(1)  +  <t>(v)  =-^ 
dv 

The  steady  state,  (1)  defined  by 

(2)  <l>(v) 

in  which  i  =  (E-v)/R  defines  a  load-line  in  the  i-v  plane  as  shown  in  Fig.  3  and  the  intersection 
point  between  the  load-line  and  <tKv)  defines  the  operating  point  A. 
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When  E  >  V}  and  the  load-line  i(v)  =  (E-v)/R  intersects  the  i-v  characteristics  in  the 
negative  resistance  region  i.e.,  the  operating  point  A  falls  in  region  11  as  shown  in  Fig.  3.  the 
circmt  of  Fig.  1(b)  exhibits  the  limit-cycle  oscilbtions  indicated  by  the  closed  dotted  line  trajectcxy 
abcda.  The  portion  a6^  of  the  trajectory  corresponds  to  the  conduction  of  a  current  spike  through 
the  nonlinear  element  when  the  capacitor  voltage  v(t)  reaches  the  breakdown  value  vj ,  while  £e 
portion  da  corresponds  to  the  interspike  interval  during  which  the  charge  in  capacitor  C  is  being 
restored  and  the  nonlinear  element  is  in  off  state.  Changes  in  the  values  of  E,  v,  and/or  R,  cause 
the  load-line  and  operating  point  A  to  shift.  Shifting  the  location  of  A  within  the  negative 
resistance  regions  alters  the  sp^  of  modon  along  the  limit-cycle  trajectory  aixl  this  altars  the  firing 
frequency.  Limit-cycle  oscillations  cease  to  exist  when  the  operating  point  A  enters  either  of 
regions  I  or  ID.  When  A  is  in  region  HI,  the  nonlinear  element  remains  on  while  for  an  A  falling  in 
region  I,  the  nonlinear  element  remains  off.  The  simple  circuit  dynamics  described  qualitatively 
above  become  considerably  mtne  complicated  in  the  presence  of  a  time  varying  or  periodic  driving 

signal  u(t).  Then  we  need  to  modify  eq.  (1)  by  replacing  <t>(v)  with  <|>(v  -  u(t))  which  means  that  the 

limit-cycle  dynamics  are  complicated  by  time  dependent  displacement  of  the  <|>  curve  in  the 
horizontal  direction  vis-a-vis  the  stationary  load  line.  This  and  Ae  interplay  between  u(t)  and  the 
capacitor  voltage  v(t)  is  the  underlying  cause  of  the  complex  dynamics  of  the  bifurcating  neuron 
circuit  and  the  observation  of  complex  firing  sequences  for  certain  values  of  parameters  of  the 
periodic  driving  signal  u(t)  as  will  be  shown  later. 

The  limit-cycle  trajectory  can  be  readily  displayed  on  a  CRO  by  driving  the  x-axis  of  the 
CRO  with  the  voltage  drop  proportional  to  i(t)  in  Fij^.  3  and  the  Y-axis  with  the  voltage  v  =  v(t) 
appearing  across  the  nonlurear  element  An  example  of  such  a  display  is  given  in  the  photograph 
of  Fig.  4  which  represents  the  limit-cycle  trajectory  in  the  i-v  phase-space  of  the  circuit  of  Fig. 
1(b)  and  is  seen  to  be  similar  to  the  idealized  limit-cycle  trajectory  marked  in  Fig.  3.  The  aj^arent 
uneven  brighmess  of  the  trajectory  in  the  photograph,  reflects  the  uneven  speed  with  which  the 
electron  bem  traces  the  trajectory  on  the  CRO’s  phosphor  screen  as  dictated  by  the  time  variation 
of  i(t)  and  v(t).. 

The  current  waveform  i(t)  corresponding  to  limit-cycle  oscillations  consists  of  a  train  of 
narrow  fixed-shape  impulses,  or  spikes,  of  fixed  peak  amplitude  io  and  duration  T ^  corresptxuling 

to  the  a  ->  b,  b— >  c  — >  d  portions  of  the  trajectory.  The  interspike  interval  T2  corresponds  to  the 

d->  a  interval  of  the  trajectory.  The  spikes  in  i(t)  represent  action  potentials  so  to  speak,  of  the 
bifurcating  neuron.  Expressions  for  T^  and  T2  can  be  readily  deriv^  when  the  nonlinear  element 
in  Fig.  1  is  assumed  to  have  the  idealized  S-shaped  characteristic  of  Fig.  3.  The  results  are  given 
by  [19], 

(3)  T,  =  RC 


(4) 


T2  =  pC  .^n 


where  p  =  RRi/(R  +  Ri),  with  Rj  being  the  nonlinear  element's  resistance  in  segment  HI  of  the  i-v 
characteristics,  and  Vq  is  the  voltage  value  at  the  point  of  intersection  of  the  extension  of  segment 
in  of  the  characteristic  with  the  v  axis  as  shown  in  Fig.  3.  All  other  quantities  in  eqs.  (3)  and  (4) 
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A 


i-<|>(v) 


Fig.  3.  Piece-wise  linear  approximatioa  of  the  i-v  (current-voltage)  characteristic  of  the  glow-lamp  (j^ow-discharge 
tube)  consisting  of  three  segments:  I,  II  and  in  and  showing  the  limit-cycle  trajectory  abed  (dotted  line)  rrfened  to  in 
the  text 


V 


Fig.  4.  Limit  cycle  trajectory  in  the  i-v^.  phase-space  of  the  circuit  of  Fig.  1  assodaied  with  the  naniial  oscillations 
of  the  bifurcating  neuron  (E  s  160  [V],  R  s  100  [KX2],  C  »  .UpF],  s  10  [Q].  Vertical  scale:  10  [mA/div.], 
horizontal  scale:  10  (V/div.]).  A.C.  coupled  CRO  display  is  shown. 
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arc  as  defined  earlier.  Note  that  eqs.  (3)  and  (4)  are  meaningful  only  when  E  >  vj.  When  this 

condition  is  not  satisfied  the  arguments  of  the  in  (....)  term  in  eq.  (3)  becomes  negative  precluding 
thereby  physically  meaningful  solutions  and  disruption  of  limit-cycle  oscillations.  In  accordance  to 
eqs.  (3)  and  (4)  the  interspike-interval  or  period  of  the  bifurcating  neuron’s  oscillation  is  T  =  Tj  + 
T2  and  the  instantaneous  firing  frequency  is  f  1/T  =  1/(T  i  +  T2).  Because,  in  normal  operation, 
Rj  «  R,  we  find  that  the  spike  width  T2  is  much  less  than  the  interspike  interval  T  i  and  therefore 

T  —  T1  becomes  a  good  approximation. 


It  is  convenient  to  consider  two  regimes  of  operation  when  u(t)  *  0.  One  is  the  none  self- 
oscillatory  regime,  and  the  other  is  the  self-oscillatory  regime.  We  consider  next  the  regime  when 
the  supply  voltage  E  is  slightly  less  than  the  breakdown  voltage  v  so  that  the  nonlinear  element  is 
normally  in  extinguished  state  and  the  limit-cycle  oscillations  are  not  triggered  spontaneously.  The 
subthreshold  value  of  E  in  this  case  is  such  that  the  addition  of  the  driving  signal  is  sufficient  to 
uigger  the  limit-cycle  oscillation  whenever  the  voltage  across  the  nonlinear  element  exceeds 
threshold.  Thus,  in  the  presence  of  u(t)  and  specially  when  u(t)  is  sufficiently  slowly  varying,  that 
is  the  scale  of  its  time-variations  is  large  compared  to  the  natural  oscillation  period  T  of  the  circuit 
we  discussed  earlier,  the  effect  of  u(t)  on  the  circuit  can  be  shown  to  be  reproduced  by  suitably 
changing  the  supply  voltage  E.  In  this  regime,  our  bifurcating  neuron  exhibits  nearly  sigmoidal 
dependence  of  firing  frequency  on  activation  i.e  ,  on  the  effective  voltage  across  the  nonlinear 
element. 


To  show  this  we  proceed  as  follows:  Because  the  spike  width  T2  is  very  narrow,  we 
approximate  the  spike  portion  abed  of  the  limit-cycle  oscillation  of  the  S-shaped  nonlinearity  in 
Fig.  3  or  Fig.  2  by, 


(5) 


i  =  <t>  (v)  =  0  (v') 


v'  <  VI 

V  >  VJ 


Since, 


v'  =  V  -  u 


we  have. 


(6) 


Vl=Vth-U  Vth  =  Vl-l-U 

V2  =  Vex  -  u  ^  Vex  =  V2  +  u. 


The  spike  amplitude  i©  in  eq.  (5)  is  determined  by  considering  the  change  AQ  in  the  charge  stored 
in  the  capacitor  C  because  of  a  single  firing  of  the  neuron,  i.e.,  the  discharge  of  the  capacitor 
during  the  abed  portion  of  a  limit-cycle  oscillation.  This  is. 
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AQ  =  C  Av 


or, 

Jidt  =  C(vth  -  Vex)  =  C(vi  -  V2) 


or  by  using  eq.  (5) 


Jio5(t)  dt  =  C(vi  -  V2) 


(7)  io  =  C(vi  -  V2). 


which  is  seen  to  be  determined  by  the  capacitance  C  and  the  parameters  vl  and  v2  of  the  nonlinear 
S-shaped  nonlinearity. 


Now  when  v  <  ,  <J>(v  -  u)  =  0  ,  eq.  (1)  reduces  to. 


_  dv  E  -  V 
'-dt  “  R 

whose  solution  is, 

.  t 

(8)  v(t)  =  E  -  (E  -  V2  -  v(o))  e  RC 


which  represents  the  capacitor  voltage  build-up  in  the  d-^  a  region  of  the  limit-cycle  trajectory  in 
Fig.  2. 

The  interspike  interval  T  is  found  from  eq.  (8)  by  letting  t  =  T  and  v(T)  =  vtji(T)  =  vj  -t- 
u(T).  This  yields. 


(9) 


T  = 


RCin 


E  -  V2  -  u(o) 
E  -  VI  -  u(T) 


RC.^n 


^  ~  ^ex(o) 
E  -  Vth(T) 
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This  analytical  model  is  known  as  the  integrate  and  fire  model  of  the  spiking  neuron. 

We  see  from  comparing  eqs.  (9)  and  (6)  that  the  effect  of  the  activation  potential  (the 
neuron’s  driving  signal)  u  is  to  horizontally  shift  the  curve  4>(v  -  u)  in  Fig.  2  to  the  left  or  to  the 
right  by  an  amount  that  depends  on  the  magnitude  and  sign  of  u.  Because  the  load  line  is 
stationary,  this  horizontal  shifting  causes  a  migration  of  the  operating  point  A  on  the  <()  curve. 


When  u  is  positive  and  is  increasing,  <()(v  -  u)  would  be  shifted  to  the  right  until  v^h  >  E  at 

which  instant  the  operating  point  A  enters  segment  I  of  the  ()>  curve  and  limit-cycle  oscillations  are 
halted  or  extinguished  in  agreement  with  the  prediction  of  eq.  (9)  when  Vjjj  >  E.  This  is 
analogous  to  complete  inhibition  of  the  firing  of  a  biological  neuron  when  a  sufficient  inhibitory 
activation  potential  u  is  present.  Thus  a  positive  u  in  our  neuron  model  corresponds  to 
inhibitory  signal. 


When  u  is  negative  and  is  gradually  decreased,  0(v  -  u)  shifts  gradually  to  the  left.  This 
has  the  effect  of  increasing  the  firing  frequency.  A  negative  u  in  our  neuron  circuit  (Fig.  1(b))  is 
therefore  equivalent  to  an  exitatory  input.  As  the  value  of  u  is  made  i  re  negative  (increased 

excitation)  the  shift  of  <()(v')  to  the  left  continues  until  the  operating  point  A  leaves  the  negative 
resistance  segment  II  and  enters  the  positive  resistance  segment  III  of  the  (j)  curve.  When  this 

happens,  the  nonlinear  element  ()>  remains  on  (conducting  state)  and  limit'Cycle  oscillations  are 
again  extinguished.  Limit-cycle  oscillations  can  not  be  triggered  then  no  matter  how  strong  an 
exitatory  signal  is  received.  This  has  the  effect  of  limiting  the  maximum  possible  firing  frequency 
and  is  analogous  to  the  limiting  of  the  maximum  firing  frequency  in  the  biological  neuron  by  the 
presence  of  an  absolute  refractory  period. 

We  examine  next  two  cases  of  u(t).  In  one  case  u(t)  =  -V^j  where  Vj^  is  a  positive  real 
constant  whose  value  is  changed  gradually  to  determine  how  T  and  hence  the  firing  frequency  f  = 

^  changes.  This  case  leads  to  conventional  sigmoidal  response  with  an  upper  limit  on  the  firing 

frequency.  In  the  second  case  u(t)  is  assumed  to  be  periodic.  It  leads  to  complex  firing  modalities 
of  ^e  neuron  and  includes  bifurcation  between  them  depending  on  parameters  of  the  periodic 
activation. 


Case  I 

Let  u(t)  =  -  Vjn.  then  eq.  (9)  becomes. 


UO) 


T  =  RC£n 


E  -  V2  ->•  Vm 

E  -  VI  +  Vm 


and  ())(v  -  u)  =  <})(v  +  Vjjj).  Thus  a  positive  increasing  Vjji  corresponds  to  increased  inhibition  and 
a  lowering  of  the  firing  frequency  f  =  ^ ,  while  a  negative  decreasing  Vm  corresponds  to  increased 
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excitation.  The  value  of  the  maximum  firing  frequency  fmax  value  of  Vj^  at  which  the 

maximum  firing  frequency  occurs  are  determined  fiom  Fig.  2  in  the  following  manner; 


When  <t>  shifts  to  the  left  so  that  the  load  line  passes  through  point  C  we  have, 

c3  E/R 
E  -  Vex  "  E 

or, 

c? _ ^ 

E  -  V2  -  Vm  -  R 


which  yields  the  maximum  value  of  for  which  firing  stops. 


(11)  Vm  =E-v2-cdR 

max 


by  combining  eqs.  (8)  and  (7)  one  obtains  an  expression  for  the  maximum  firing  frequency  of  our 
neuron. 


(12) 

where 


f. 


max 


^mui 


(13) 


^min 


=  RC/n 


2E  -  2v2  -  ^R 
2E  -  (vj  +  V2)  -  ^R 


Thus  the  circuit  in  Fig,  1  (b)  is  seen  to  have,  when  u(t)  =  -  Vjjj,  an  upper  limit  on  the  firing 
frequency  determined  by  the  value  of  Vm.  The  upper  bound  on  firing  frequency  is  imposed  by  the 
nonlinear  element  remaining  in  the  on  state  which  stops  limit-cycle  oscillations.  In  contrast  the 
maximum  firing  frequency  of  a  biological  neuron  is  imposed  by  the  absolute  refractory  period 
immediately  following  each  action  potential  (spike  firing)  during  which  the  neuron  is  incapable  of 
firing  again  no  matter  how  strong  an  exitary  stimulus  it  receives.  In  limiting  the  maximum  firing 
frequency,  refractoriness  in  the  living  neuron  helps  conserve  energy  and  eliminate  reverbrations. 

The  dependence  of  firing  frequency  f  =  ^  on  Vm  where  T  is  given  by  eq.  (10)  is  shown  in 

Fig.  5  (solid  line  curve  marked  Tj.  =  0)  together  with  the  measured  firing  frequency  (dotted  line 

curve).  The  two  curves  are  for  the  case  when  the  S-shaped  nonlinearity  (j)  is  that  of  a  glow-lamp. 
The  circuit  parameters  and  glow-lamp  param«  'ers  are  given  in  the  figure  caption.  Verification  of 
eq.  (12)  for  an  example  of  glow-lamp  S-shap  ..  nonlinearity  with  following  parameters; 
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yields. 


VI  =  141  [V] 
C  =  10-7  [F] 


V2  =  125  [V] 
^  =  4  10-4  [A] 


E  =  160  [V] 
R  =  105  [Q] 


7'min  7.6  10"^  [sec] 


and  therefore 


^max  “T  .  -  1^1  [1^^] 

^min 

which  is  in  agreement  with  the  experimental  cutoff  frequency  observed  in  the  preceding  figure. 
Figure  5  also  shows  the  effect  of  arbitrarily  adding  an  absolute  refractory  interval  Tj-  [msec]  to  the 
right-hand  side  of  eq.  (10).  Such  ad  hoc  inclusion  of  Tj.  is  seen  to  accelerate  saturation  of  the 
firing  frequency. 

Case  II 

The  preceding  analysis  shows  basically  the  bifurcating  model  neuron  can  exhibit  usual 
sigmoidal  response.  We  show  next  however,  that  when  the  activation  potential  u(t)  is  periodic, 
the  neiffon  alters  its  behavior  and  is  able  to  phase-lock  its  firing  to  the  frequency  of  the  periodic 
activation  or  a  sub-harmonic  of  it,  or  can  fire  quasiperiodically,  erratically,  or  in  bursts,  all 
depending  on  the  amplitude  and  frequency  of  the  activation  potential.  Periodic  activation  at  the 
neuron's  hillock  is  assumed  to  arise  whenever  the  spike  trains  (action  potentials)  incident  on  the 
neuron's  dendritic-tree  are  correlated.  If  we  refer  to  the  aggregate  of  all  spike  trains  incident,  at 
any  time,  on  the  dendritric-tree  as  the  incident  spike  wav^ront,  then  a  coherent  incident  spike 
wavefront  produces  a  periodic  activation  potentid,  i.e.,  a  periodic  driving  signal  for  the  neuron. 
Thus  we  examine  now  the  behavior  of  the  circuit  of  Fig.  1  fb)  when  the  activation  potential  u(t)  is 
periodic.  The  main  result  of  the  analysis  is  a  Phase-Transition  Map  (PTM)  which  relates  the  phase 

®n+l  spike  produced  by  the  neuron  to  6^,  the  phase  of  the  n-th  (preceding)  spike.  In 

our  formulation,  the  phase  of  a  spike  is  always  measured  relative  to  the  immediately  preceding 
peak  (or  some  other  selected  feature)  of  the  periodic  activation  potential.  The  PTM  is  a  nonlinear 

iterative  map  on  the  (0  -  2k)  interval,  and  as  such,  it  lends  itself  to  further  analysis  and  treatment  as 
is  usually  done  in  nonlinear  dynamics  with  other  maps  of  the  interval  onto  itself  like  the  logistic 
and  the  circle  map  for  example. 

For  simplicity  we  assume  the  periodic  activation  or  driving  signal  of  the  neuron  (essentially 
the  membrane  potential  at  the  hillock)  is  cosinusoidal  of  amplitude  a,  radian  frequency  (O5  and  fixed 
phase  ©o,  i.e., 

u(t)  =  a  cos((Dgt  -1-  Qq) 
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Fig.  5.  DqieiideiKx  of  Firing  frequency  on  activation  potential  Vm.  Example  of  glow-lamp  S-shaped  nonUneaiity 
with  the  foUowing  fixed  parameters:  VI  »  141  [volts]  V2  =  125  [volts],  RC  *  0.01  [sec.],  E  »  160  [volts],  (solid) 
computed,  (•  •  •)  measured. 


Then  starting  from  the  expression  for  the  capacitor  voltage  in  Fig.  1(b)  we  derived  earlier  (eq.  8), 


t 

(14)  v(t)  =  E-(E-V2-u(o))e‘RC 


and  by  referring  to  Fig.  6  we  see  that 


(15)  v(t)  =  v^(t)  at  t  =  T 
where 

(16)  v^(t)  =  Vj  +  a  cos(to^t  +  0q) 


Therefore  from  eqs.  (14)  and  (15), 


(17) 


T 

E  -  Vj  -  a  cos  (to^T  +  0^)  =  (E  -  V2  -  a  cos0)  e  RC 
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\  Relation  between  the  periodic  driving  signal  (activation  poteniiaJ'i  appearing  in  V,h(t),  the  capacitor  voltage  V(t),  and  the  spike 
rn  produced  by  the  neuron  (only  two  spikes  are  shown)  referred  to  in  the  accompanying  analysis.  Note  the  relation  between  Bn, 


(18) 


“cT  =  -  0n 

s  n+1  n 


0'  .  ,  -  0. 


and  letting 


®  n+ 1  “  ®  n+ 1  m  =  integer 


we  have, 


®n+l  “  ®  n+1  '  n+1^  mod.27c 


Therefore  from  eqs.  (14)  and  (17), 


E  -  Vj  -  a  cos0’jj^j  =  (E  -  V2  - 


0'  ,  -  0 
^  i-n±J — ^ 

(nJRC 

,)c  ' 


or,  in  terms  of  normalized  variables. 


(1  -  v’j  -  a' cos0’^^j)  e  ^  =  (1  -  v'2  -  a' cos0^)  e 


where. 


v'j=Vj/E  ,  v'2  =  V2/E  ,  a’  =  a/E 


X  =  0)^RC 
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Therefore  finally  frcan  (24)  and  (22), 


(25) 


and 


+  x/n  ( — 
1  - 


-  v’-  -  a'  cos0^ 
- 2 —  ..  n — ^ 

V,  -  a'  cose'„^, 


®n+l  ■'•®'n+l'mod.2l 

Equations  (25)  and  (26)  are  the  main  results  of  this  analysis.  Taken  together  they  form  the 
relationship  or  mapping  between  0^,  the  phase  of  the  n-th  spike  and  0^^^.  the  phase  of  the 
(n+l)th  spike  in  the  firing  activity  of  the  neuron  under  cosinusoidal  activation.  Note  the  phase  of  a 
spike  is  always  measured  from  the  immediately  preceding  peak  of  the  cosinusoidal  drive  signal 
(see  Fig.  6).  Equations  (25)  and  (26)  can  be  expressed  symbolically  in  the  form 


»„.i =«<«„) 

where  the  function  g(.)  is  defined  by  the  mapping  in  eqs.  (25)  and  (26).  Note  that  eq.  (25)  is  a 
transcendental  equation  in  0^^^^  that  must  be  solved  first  given  0^^,  the  system  parameters,  and 
those  of  the  periodic  driving  signal.  Having  obtained  0’^^^^  its  modulus  2ic  is  computed  to  obtain 
We  call  the  mapping  in  eq.  (27)  Phase-Transition  Map  (PTM).  The  PTM  is  a  nonlinear 

iterative  map  of  the  interval  [0,  2jc]  onto  itself.  Like  other  iterative  maps  of  interval  onto  itself, 
such  as  the  Logistic  Map  and  the  Circle  Map,  the  PTM  can  be  studied  using  the  tools  of  nonlinear 
dynamical  systems. 

For  example.  Fig.  7  shows  the  steps  involved  in  obtaining  a  plot  for  the  PTM  of  the 
bifurcating  neuron  using  eq.  (25)  assuming  a  glow-lamp  S-shaped  nonlinearity  and  a  periodic 

COS 

cosinusoidal  driving  signal  activation  of  amplitude  a=l[V]  and  frequency  f  = —  =  190  [Hz]. 

*  2k 

The  PTM  for  this  case,  shown  in  Fig.  7  (c),  can  be  iterated  graphically  as  illustrated  in  Fig.  8(a). 

Entering  the  abscissa  of  this  figure  from  an  initial  value  0^  one  draws  a  vertical  line  that  intersects 

o 

the  plot  giving  the  value  of  0j^  which  can  be  re-entered  on  the  abscissa  and  the  process  repeated  to 
find  $2  and  so  forth.  This  procedure  is  greatly  simplified  by  using  the  45°  line  =  Ojj  in 
performing  the  iterations.  From  0^  one  moves  vertically  to  intersect  the  PTM  plot,  then  moves 
horizontally  to  intersect  the  45°  line,  at  a  point  whose  abscissa  gives  the  value  of  0p  then  move 
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vertically  to  intersect  the  plot  again  and  move  from  there  horizontally  to  meet  the  45°  line  and 
obtain  62  and  so  fmth.  Starting  from  any  initial  value  6^  the  resulting  sequence  of  phase  values 
6^  ns  1,2, 3...  eventually  settles  into  a  regular  pattern  (orbit)  or  seemingly  an  erratic  one  depending 

on  the  parameters  of  the  plot.  The  values  of  6^  vs.  n  where  n  is  the  iteration  number  is  shown  in 
Fig.  7(b)  for  1000  iterations.  This  plot  clearly  shows  the  firing  pattern  of  the  neuron  for  this  case 
is  quite  complex  but  regular.  Figure  8(c)  shows  the  Lyapunov  exponent  [20], 

(28)  A.=  /im[^Z  lg’(en)l  ] 

N-«o 


where  g'Cfi^)  is  the  derividve  or  slope  of  the  PTM  at  the  iteration  points,  the  values  of  6^  produced 

by  the  map,  and  the  periodically  driven  bifurcating  neuron  it  represents.  The  Lyapunov  exponent 

is  a  measure  of  regularity,  or  lack  of  it,  in  iterative  maps.  A  value  of  X  >  0  is  usually  taken  as  an 

indication  of  irregularity  or  chaos.  Because  the  0^^  pattern  in  Fig.  8(b)  is  regular,  the 

corresponding  Lyiqiunov  exponent  is  seen  to  stabilize  to  zero  after  few  hundred  iterations  when 

transients  die  out.  The  shape  of  the  PTM  changes  markedly  when  the  parameters  (a,  f  )  of  the 

s 

cosinusoidal  driving  signal  are  altered.  This  is  demonstrated  in  the  PTM  plots  and  associated  0^^ 
orbits  which  show  period  three  (left)  and  period  six  (right)  Hring  modalities  shown  in  Fig.  9. 
Again  because  the  0^^  patterns  are  ordered  the  Lyaponov  exponents  of  the  two  plots  are  negative. 
It  is  worth  noting  that  period<N  firing  modality  covers  the  case  when  the  neuron  is  bursting. 

Qualitatively,  similar  results  are  obtained  for  bifurcating  neurons  employing  solid-state 
nonlinear  elements  with  S-shaped  nonlinearity  such  as  the  unijuncdon  transistor  (UJT)  and  the 
programmable  unijunction  transistor  (PUT)  which  are  solid-state  equivalents  of  the  glow  lamp. 

A  more  encompassing  view  of  bifurcating  neuron  dynamics,  is  provided  by  the  bifurcation 
diagram.  This  is  an  intensity  plot  of  the  resulting  phase  orbit  or  sequence  0^^,  after  transients  are 

allowed  to  die  down.  The  values  of  0^^  are  entered  as  points  along  the  vertical  above  each 
frequency  point  as  shown  in  the  example  of  the  measured  bifurcation*  diagram  for  a  programmable 
unijunction  transistor  neuron  (PUTON)  embodiment  of  the  bifurcating  neuron  shown  in  Fig.  10 

for  a  fixed  driving  signal  amplitude  a  =  0.175  [V].  Note  the  values  of  0^  in  the  figure  are  shown 

normalized  to  27c.  The  figure  demonstrates  clearly,  and  at  one  glance,  the  rich  variety  of  firing 
modalities  the  PUTON  goes  through  as  the  frequency  of  periodic  activation  (driving  signal)  is 
altered. 


•Circuit  diagram  of  the  PUTON  and  the  0^  measurement  system  are  not  given  here  but  will  be  included  in  a  future 
publication. 


162 


Phase  Bifurcation  Diagram  of  PUTON 
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Fig.  1 0.  Measured  bifurcation  diagram  for  a  PUTON  embodiment  of  the  bifurcating  neurtxt  for  a 
=  .  175  [V].  Note  the  variety  of  phase-locked  regular  firing  modalities  for  which  only 

one  value  of  0^^  is  produced  at  each  driving  frequency,  separated  by  more  complicated 

and  perhaps  erratic  0^  sequences  occurring  at  other  driving  frequency  values. 


164 


The  information  in  the  bifurcation  diagram  can  also  be  presented  in  a  somewhat  different  format, 
that  of  a  Sawtooth  Phase-lock  diagram  or  its  equivalent  the  Devil's  Staircase  diagram.  Since  0^^  is 
the  relative  phase  between  the  n-th  spike  fired  by  the  neuron  and  the  immediately  preceding  peak  of 
its  cosinusoidal  activation  potential  or  driving  signal,  the  timing  of  the  spikes  can  be  determined 
and  therefore  also  the  interspike  interval  or  its  inverse:  the  instantaneous  Hring  frequency  f.  Plots 

f 

of  the  firing  frequency  f  vs.  driving  frequency  f  or  of  the  phase-lock  ratio  p  =  vs.  f  also 

called  the  rotation  ratio  computed  from  the  0^  sequences  are  shown  in  Fig.  11  for  a  bifurcating 
neuron  assuming  a  glow-lamp  nonlinearity.  Note  both  phase  lock  diagrams  show  presence  of 
driving  signal  frequency  windows  over  which  perfect  phase-locking  with  negligible  standard 
deviation  in  firing  frequency  occur.  These  regions  are  separated  by  regimes  of  more  complex 
firing  nKxlalities  including  quasi-periodic  perhaps  erratic  firing  for  which  the  standard  deviation  of 
the  firing  frequency  is  finite.  The  Devil's  Staircase  diagram  measured  for  a  bifurcating  neuron 
embodiment  employing  a  PUT  nonlinearity  is  shown  in  Fig.  12(a)  with  a  high  resolution 
(expanded)  plot  of  one  of  the  segments  lying  between  two  phase-lock  regions  given  in  Fig.  12(b). 
This  latter  plot  shows  clearly  the  self-similar  or  fractal  nature  of  the  diagram  where  a  staircase 
structure  is  easily  discerned  in  the  mean  and  t^e  mean  plus  and  mean  minus  standard  deviation 
branches  of  the  diagram  .  Note  however  that  tne  finer  rungs  of  the  staircase  appear  to  be  blurred 
by  noise  in  the  PUTON  circuit  used  in  making  the  measurement.  The  transistion  from  a  phase- 
lock  region  where  the  standard  deviation  of  tiring  frequency  (or  rotation  number)  is  negligible,  to  a 
region  where  the  standard  deviation  is  finite  is  seen  to  be  quite  abrupt  attesting  to  the  rapid 
switching  of  behavior  of  the  bifurcating  neuron  as  the  value  of  the  bifurcation  parameter,  f^  in  Ais 
case,  is  altered.  Again  the  plots  of  Figs.  11  and  12  are  for  a  fixed  value  of  driving  signal 
amplitude.  These  plots  can  be  regarded  as  the  phase-lock  frequency  response  of  the  bifurcating 
neuron  at  fixed  driving  signal  amplitude.  Phase-lock  frequency  response  plots  at  different  discrete 
values  of  driving  signal  amplitude  can  be  obtained  in  a  similar  fashion.  The  data  contained  in  such 
a  series  of  plots  can  be  presented  compactly  in  the  form  of  the  Arnold  Tongues'  diagram  shown  in 
Fig.  13.  This  diagram  can  be  interpreted  as  the  frequency  response  of  an  active  nonlinear  device, 
like  the  bifurcating  neuron,  that  is  capable  of  phase-locking  its  firing  to  the  driving  signal.  The 
horizontal  lines  in  this  plot  give  the  width  of  the  periodic  phase-lock  firing  regions  at  each  driving 
signal  amplitude  as  the  frequency  is  swept.  Outside  these  regions,  which  are  shaped  like  wedges 
or  tongues  pointed  downward,  the  firing  can  be  period-N,  quasi-periodic  or  even  erratic.  The 

tongues  represent  phase-lock  regions  with,  starting  from  the  left,  integer  phase-lock  ratio  p  = 
1,2,3,...,.  It  is  noted  the  width  of  these  integer  phase-lock  regions  decreases  as  the  amplitude  of 
the  driving  signal  is  made  smaller. 

In  deriving  the  PTM,  we  assumed  cosinusoidal  periodic  activation  or  driving  signal  of  the 
neuron  to  make  the  analysis  tractable.  We  believe  the  complex  behavior  of  the  bifurcating  model 
neuron  observed  for  cosinusoidal  activation  does  persist  for  arbitrary  periodic  activation. 
Replacing  the  cosinusoidal  driving  signal  in  Fig.  6  by  an  arbitrary  periodic  signal  hints  that  this 
should  be  so  and  preliminary  simulation  results  for  an  I  &  F  neuron  driven  by  a  periodic  signal 
composed  of  several  sinusoidal  components  shows  interspike  sequences  with  similar  complexity 
as  when  it  is  driven  by  a  pure  cosinusoidal  signal.  An  interesting  question  in  this  regard  is 
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Fig.  11 

Computed  phast'lock  frequency  response  of  bifurcating  neuron  employing  glow.lamp  S* 
shaped  nonlinearity.  Driving  cosinusoidal  signal  is  u(t)  =  acos  (2)irst).  Plots  shown  are  for 
a  =  10(V]  and  operating  conditions:  supply  voltage  E  =  160  [V],  glow-lamp  breakdown  and 
extinction  voltages  Vj  =  140(V1,  and  V2  =  100[VJ  respectively,  a)  Sawtooth  phase-lock 
diagram  and  b)  corresponding  staircase  phase-iock  diagram.  In  both  diagrams  the  upper 
branches  are  the  mean  plus  standard  deviation  of  the  firing  frequency  and  the  lower  branches 
are  the  mean  minus  the  standard  deviation  of  the  firing  frequency  In  the  phase-lock  regions 
the  standard  deviation  is  zero.  The  mean  and  standard  deviation  ai.  each  driving  frequency  fg 
are  calculated  from  a  200  msec  record  of  the  firing  activity.  Note  in  both  plots  the  integer 
phase-lock  regions  (where  the  phase-lock  ratio  p  =  fj/f  =  1.2,3  ....)  are  separated  by  regions 
of  irregular  firing  where  the  standard  deviation  in  firing  frequency  is  finite.  These  regions  of 
erratic  firing  furnish  adaptive  "noise"  that  could  play  a  role  in  "annealing"  bifurcating  neuron 
networks  i.e.,  drive  them  into  entrained  (phase-locked)  states. 
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Devil's  Staircase  Diagram  or  PLTON 
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Fig.  12.  Measured  Devil’s  Staircase  diagram  of  the  periodically  driven  PUTON,  a  bifurcating 
neuron  embodiment  employing  a  programmable  unijunction  transistor  (PUT)  furnishing  the  S- 
shaped  membrane  nonlinearity  (a),  and  expanded  region  of  the  diagram  illustrating  the  self-similar 
or  fractal  nature  of  the  Devil's  Staircase  diagram  (b). 
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Arnold  's  Tongue  Diagram  of  PUTON 


Fig.  13.  The  Arnold  Tongue  s  diagram  of  the  PUTON  representing  the  phase-lock  regions  of  the 
periodically  driven  PUTON  (horizontal  bars)  for  selected  driving  signal  amplitude.  If  standard 
deviation  of  the  interspike  interval  is  below  .04  [msj  a  horizontal  bar  is  plotted.  In  the  regions 
between  the  horizontal  bars,  which  form  the  tongues,  the  firing  is  quasi-periodic  or  erratic. 
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whether  the  bifurcating  neuron  phase-locks  its  firing  to  the  strongest  spectral  component  of  the 
arbitrary  periodic  signal  or  not.  This  question  is  under  investigation. 

3.  IMPLICATIONS  AND  SIGNIFICANCE  OF  THE  BIFURCATING 
NEURON  CONCEPT. 

The  results  presented  so  far  show  that  the  model  neuron  we  consider  has  much  more 
complex  behavior  than  simple  sigmoidal  response  employed  widely  in  neural  modeling  today, 
where  the  state  of  the  neuron  is  represented  by  its  firing  frequency,  and  the  firing  frequency  is  a 
nonlinear  (sigmoidal  shaped)  function  of  the  activation  potential.  In  this  mode  of  operation,  and 
when  the  activation  potential  is  above  threshold,  the  neuron  can  be  viewed  as  a  VCO  (voltage 
controlled  oscillator)  with  a  highly  nonlinear  (spike)  output  waveform.  We  have  seen,  however, 
when  the  activation  potential  becomes  periodic,  the  neuron  can  alter  its  behavior  (bifurcate)  to  fire 
in  a  variety  of  phase-locked  regular  or  erratic  firing  modalities  depending  on  the  parameters 
(amplitude  and  frequency)  of  the  activation  potential.  Hence  our  use  of  the  term  bifurcating  neuron. 
Thus  the  bifurcating  neuron  can  act  not  only  as  a  sigmoidal  neuron,  as  when  its  activation  potential 
is  slowly  varying  and  is  not  periodic,  but  also  can  act  as  a  detector/encoder  of  episodes  of  coherent 
incident  spike  wavefronts  at  its  dendritic-tree  that  give  rise  to  periodic  activation. 

The  PTM,  which  is  derived  in  the  form  given  here  for  the  first  time,  provides  a  powerful 
tool  for  studying  the  dynamics  of  realistic  neurons  and  neural  networks  using  the  customary  tools 
of  nonlinear  dynamics  such  as:  bifurcation  diagram,  Lyapanov  exponent,  entropy  of  the  firing 
patterns  or  phase  orbits.  Devil's  staircase  diagram,  and  Arnold  Tongues'  diagram,  all  tools  that 
provide  a  novel  and  useful  approach  to  modeling,  characterizing,  and  better  understanding  of 
cortical  networks,  higher-level  cortical  functions,  and  of  how  to  ultimately  incorporate  feature¬ 
binding  and  cognition  and  other  higher-level  cortical  functions  in  man-made  systems. 

In  the  above  analysis  a  clear  relation  between  a  spike  or  action  potential  fired  by  a  neuron 
and  the  limit-cycle  trajectory  in  the  i-v  plane  of  the  S-shaped  nonlinearity  used  to  represent  the 
neuron's  excitable  membrane  was  established.  It  is  interesting  to  speculate  as  to  why  use  of  limit- 
cycle  oscillations  and  spiking  neurons  has  evolved  in  biological  systems.  We  offer  the  following 
reasons  for  such  use: 

(a)  Stability  of  the  limit-cycle  oscillation,  and  hence  its  robusmess  and  immunity  while  it 
persists  to  noise:  The  shape  of  spikes  fired  by  the  neuron  is  invariant. 

(b)  Rapid  entrainment  and  synchronization  of  systems  possessing  limit-cycle  oscillations  such 
as  the  bifurcating  neuron  which  would  allow  coherent  states  to  evolve  rapidly  within  a 
coupled  population  of  bifurcating  neurons  (bifurcating  neural  network).  A  coherent  or 
synchronized  state  is  manifested  by  fixed  relative-phase  pattern  (vector)  between  the  firings 
of  neurons  in  the  network  and  by  coherent  incident  spike  wavefronts  received  by  each 
neuron  from  other  neurons  in  the  networic. 

(c)  The  relative-phase  vector  of  a  synchronized  network  of  N  neurons  is  defined  by  an  N- 
vector  <(>  with  elements  <j)^  i=l,2,...N  where  0  <  0.  <  2jt  is  the  phase  between  a  spike 
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fired  by  the  i-th  neuron  and  the  immediately  preceding  spike  of  an  arbitrarily  selected 

t; 

reference  neuron.  If  the  i-th  neuron  fires  Xj  [sec]  after  the  reference  neuron,  then  =  j  2n 

where  T  is  the  interspike  interval  of  the  reference  neuron.  By  this  definition,  the  reference 

neuron's  in  phase  would  be  zero.  If,  because  of  measuring  accuracy,  the  phase  of  each 
neuron  can  be  distinguished  over  L  levels  then  the  total  number  of  possible  distinguishable 
permutations  or  states  the  network  can  assume  is  (NL)!.  In  contrast,  a  network  of  N 
binary  neurons  has  a  total  of  2^  possible  states  while  a  network  of  N  sigmoidal  neurons 
whose  outputs  arc  distinguishable  over  L  levels  has  a  total  of  L  possible  states.  If  we 
take  for  the  sake  of  illustration.  N=3  and  L=3.  then  the  bifurcating  neuron  network  can 

3 

have  a  total  of  (3x3)!  =  362,880  possible  states.  This  exceeds  by  far  the  2  =8  states 
possible  had  the  network  been  of  binary  neurons  or  the  3^=27  states  had  it  used  sigmoidal 
neurons.  These  simple  considerations  serve  to  illustrate  that  in  their  making  use  of  phase 
information,  temporal  networks,  possess  far  denser  state-space  available  for  accessing  by 
network  dynamics  than  conventional  sigmoidal  or  binary  neuron  networks.  Accordingly 
one  can  intuitively  expect  that  bifurcating  neural  networks  would  exhibit  richer  state-space 
behavior  than  sigmoidal  networks  or  their  binary  cousins  and  to  be  capable,  in  general,  of 
carrying  out  more  complex  signal  processing  operations  and  computations. 


It  is  interesting  to  note,  in  connection  with  the  above  remarks  that  a  possible  advantage  of 
multilevel  sigmoidal  neuron  networks  demonstrated  in  handling  a  gray-level  image 
processing  application  is  that  the  number  of  neurons  and  number  of  interconnections  are 
reduced  compared  to  binary  neuron  networks  [21]  which  is  an  important  concern  in  VLSI 
implementations  of  neural  network.  In  the  referenced  work  each  neuron's  response  was 
represented  by  a  multilevel  sigmoidal  function  of  L=16  levels.  The  results  suggest  that  the 
smaller  the  slope  of  the  staircase  nonlinearity  for  each  neuron,  i.e.,  the  larger  is  the  value  of 
L,  the  larger  is  the  domain  of  attraction  of  each  desired  equilibrium  point.  The  complexity 
of  VLSI  realization  of  multilevel  threshold  elements  can  neutralize  however  the  advantage 
of  reduced  number  of  neurons  and  interconnections  in  such  networks.  Since  the  number  of 
possible  state  levels  (relative  phase  values)  that  can  be  naturally  and  automatically  assumed 
by  a  neuron  in  a  synchronized  bifurcating  or  spiking  neural  network  can  be  quite  large, 
such  networks  could  possess  a  distinct  advantage,  over  multilevel  threshold  neural 
networks,  in  reducing  the  number  of  neurons  and  interconnections  needed  to  handle 
information  processing  tasks.  This  is  an  interesting  subject  for  further  study  of  the 
practical  advantages  of  bifurcating  neural  networks. 

(d)  If  we  adopt  the  relative-phase  vector,  as  the  state  variable  for  temporal  networks,  it  would 
be  tempting  to  speculate  that  relative-phase  patterns  in  cortical  networks  serve  as  substrates 
for  cognition  and  other  higher-level  cortical  functions.  A  relative-phase  vector  can  be 
represented  as  a  point  in  a  "relative-phase"  state-space  of  the  population  or  network. 
Periodic  changes  in  the  relative-phase  pattern  or  vector  in  such  networks  can  then  be 
represented  by  a  closed  trajectory  or  limit-cycle  in  relative-phase  state-space  of  the  network. 
Similarly  a  chaotic  or  erratic  sequence  of  relative-phase  vectors  can  be  associated  with  a 
chaotic  trajectory  in  the  relative-phase  state-space  of  the  network.  The  relative-phase  state- 
space  of  bifurcating  neural  networks  could  thus  exhibit  point,  periodic,  and  chaotic 
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attractors.  In  this  picture,  chaos  and  the  possibility  of  rapid  bifurcation  between  such 
attractors,  as  induced  by  external  signals,  is  of  interest  as  means  for  rapidly  searching  the 
state-space  of  the  network  for  coherent  states  that  are  meaningful  for  cognition,  feature¬ 
binding,  and  other  kinds  of  higher-level  processing  operations  believed  to  be  carried  out  by 
the  cortex. 

Defining  the  relative-phase  pattern  of  a  synchronized  network  of  bifurcating  or  spiking 
neurons  in  the  above  manner  is  ambiguous  because  of  the  arbitriness  of  choosing  the 
reference  neuron.  To  remove  this  ambiguity  we  recall  that  in  the  analysis  of  Section  2,  the 
relative-phase  of  the  spikes  fired  by  the  individual  neuron  in  a  synchronized  network  was 
defined  relative  to  the  peak,  or  some  other  feature,  e.g.,  zero-crossing,  of  the  periodic 
activation  signal  driving  it.  Thus  the  periodic  activation  signals  produced  at  the  hillock  of 
each  neuron  in  a  synchronized  network  can  furnish  a  natural  self-reference  for  determining 
the  relative  phase  of  spikes  produced  by  each  neuron  and  hence  the  relative-phase 
distribution  of  the  network  at  any  time  without  ambiguity.  Of  course,  the  self-reference 
signals  exist  only  when  the  network  is  in  synchronized  state.  Accordingly  the  relative 
phase  vector  is  G(t)  whose  components  0  <  6-(t)  <  271,  i=l,2,...,N  with  N  being  the 
number  of  neurons  of  the  network,  are  the  relative-phase  vs.  iteration  number,  i.e.,  the 
orbits,  produced  by  the  neurons.  The  relative-phase  vector  0(t)  represents  now  the  state  of 

the  network  unambiguously.  At  any  instant  of  time,  0(t)  describes  the  position  vector  of  a 
point  in  an  N-dimensional  relative-phase  state-space  of  the  network  whose  coordinates 

span  the  [0,  27i]  range. 

Thus  when  the  orbit  or  sequence  0^^  n=l,2,3,...for  each  neuron  is  fixed  i.e.,  0^=const  for 

each  neuron,  the  relative-phase  pattern  of  the  synchronized  network  will  be  fixed  i.e.,  0(t) 
is  constant  and  the  behavior  of  the  network  is  represented  by  a  fixed  point  in  state-space. 

In  this  case  the  network  is  both  phase-locked  and  frequency  locked.  When  0(t)  =  0(t  -k  T) 
i.e.,  the  state  vector  evolution  in  time  is  periodic,  the  behavior  is  represented  by  closed 
state-space  trajectory.  The  network  assumes  the  same  value  of  the  relative-phase  vector 
every  T  seconds.  In  this  condition,  every  neuron  can  be  in  a  period-m  firing  modality  with 
the  ratios  of  the  values  of  m  for  the  various  neurons  being  related  by  integer  numbers.  The 
network  are  then  phase-locked  but  not  frequency  locked. 

A  network  of  bifurcating  neurons  in  which  the  neurons  can  exhibit  quasi-periodic  or  erratic 
firing  for  certain  parameters  of  their  periodic  activation  signals,  could  exhibit  more 
complicated  state-space  trajectories.  Such  a  network  would  contain  neurons  with  erratic 
firing  whose  number  and  identity  can  change  in  time  producing  therefore  quasi-periodic 
and  chaotic  state-space  trajectories  that  could  visit  large  regions  of  the  state  space.  This 
suggests  that  such  chaotic  states  or  trajectories  could  serve  as  means  for  searching  the 
relative-phase  state- space  of  the  network  for  point  or  periodic  attractors  that  could  represent 
meaningful  cognitive  states  or  could,  as  proposed  by  Zak  [22],  represent  higher- level 
cognitive  processes  such  as  formation  of  new  logical  forms  based  upon  generalization  and 
abstraction. 


171 


(e)  Dynamic  Partitioning:  There  is  mounting  evidence  at  present  that  the  signal  processing 
function  of  the  dendritic-tree  of  a  living  neuron  is  not  confined  to  merely  integrating  the 
post  synaptic  potentials  produced  by  its  synaptic  inputs  (incident  spike  wavefront)  in  order 
to  form  the  activation  potential  at  the  neuron's  hillock,  but  could  also  include  more  con^lex 
nonlinear  signal  processing  operations  [23,24].  These  operatiwis  are  believed  to  stem  from 
the  action  of  excitable  membranes  at  some  spines  on  the  dendritic-tree  which  makes  for  an 
active  rather  than  passive  dendritic-tree.  This  could  give  rise,  in  a  coupled  population  of 
such  dendritic-neurons,  to  stimulus  driven  synchronization  and  feature  linking  capabilities 
[25] .  and  stimulus  driven  dynamic  partitioning  of  a  network  into  weakly  interacting 
subpopulations  (15],[26]-[281  that  can  perform  collective  computations  in  parallel  which  is 
significant  for  forming  Non  Lipschitzian  networks  with  unpredictable  dynamics.  It  is 
suggested  [2]  that  Non  Lipschitzian  networks  represent  cortical  networks  better  than 
convennonal  neural  networks  whose  dynamics  obey  the  Lipschitz  condition.  Dendritic-tree 
processing  is  meaningful  in  spiking  neuron  networks  and  therefore  it  is  not  an  issue  in 
sigmoidal  neuron  networks.  Therefore  consideration  of  spatiotemporal  processing 
operations  in  dendritic-trees  does  not  arise  in  sigmoidal  networks. 

(0  The  spiking  nature  of  the  bifurcating  neuron  and  the  complexity  of  phase  c»'bits  (0^^  values 
associated  with  the  spike  trains)  it  produces  under  changing  input  conditions  enables 
viewing  the  bifurcating  neuron  as  an  information  source.  It  also  enables  defining  the 

entropy  and  mutual  informa^Lj  x  of  the  6^^  sequences  or  firing  patterns  produced. 
Specifically,  we  can  view  the  bifurcating  neuron  as  an  information  source  with  sequential 
output  of  symbols  or  events  from  the  set  S  =  {Sj,  S2,  — Sg)  with  each  symbol  occurring 
with  fixed  probability  P(S.)  i=l,2,...,q.  If  the  probability  of  a  symbol  occurring  is 
independent  of  previous  symbols  we  say  the  neuron  is  a  zero-memorv  source.  The 
information  gained  or  received  when  the  i-ih  symbol  or  event  occurs  is  then  by  definition 

[29], 

(29)  KSi)  =  tog^  ^ 

The  average  amount  of  information  received  per  symbol  is 

q 

(30)  <I>  =  Z 

i=l 


q 

(31)  =  -  Z 

i=l 

where  H  is  the  entropy. 


P(S.)  US.) 


P(Sj)  Zog2  P(Sp  =  H 
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We  need  to  be  more  specific  about  what  is  meant  by  symbol  when  we  view  the  bifurcating 
i  as  ai 
n-1.2 . N. 


neuron  as  an  information  source.  Consider  the  plot  6^  vs.  n  shown  in  Fig.  14  where 
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Fig.  14.  Plot  of  typical  6^  vs.  iteration  number  n  referred  to  in  entropy  calculations. 

There  are  N  points  in  the  plot  and  we  assume  N  is  sufficiently  large.  The  6^  axis,  spanning  the  0- 
2ic  interval,  is  divided  into  bins.  Then,  by  counting  how  many  points  fall  in  the  i-th  bin  and 
dividing  the  outcome  by  N,  we  obtain  the  probability  Pj  of  6^^  falling  in  bin  i.  This  is  also  the 
frequency  histogram  of  0^. 

The  entropy,  which  is  a  measure  of  disorder,  is  in  accordance  to  eq.  (31), 

Nb 

H  a  -  £  P.  iog  P. 
i=l 


The  maximum  entropy  or  disorder  occurs  when  the  probability  of  0^  falling  in  the  i-th  bin 

N  N 

(i=l,2,3,...,N.)  is  constant.  The  number  of  events  per  bin  in  this  case  is  —  and  P-  =  (77-)  x 

b  \  ^  S 

<N>=  n;- 
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1  =  ( -  —  )  log  — 


X  Njj  =  log  Njj 
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Therefore  the  normalized  Entn^y 


(32) 


H 

u 

max 


1 


I  P.  log  Pj 


provides  a  normalized  measure  of  the  degree  of  disorder  in  the  orbits  6^  and  therefore  in  spike 
trains  produced  by  the  bifurcating  neuron.  An  =  1  indicates  chaotic  firing.  The  computed 

normalized  entropy  of  the  6^  sequence  in  Fig.  14  for  example,  assuming  =  100  bins  is  s 
.922. 


We  have  seen  in  the  above  that  the  bifurcating  neuron  can  be  viewed  as  an  information 
source  producing  symbols  0^  0=1,2,...  with  symbol  probability  Because  the  orbit  6^ 

depends  on  the  neuron's  input  (activation  potential),  the  symbol  probability  changes  as  the  input  to 
the  neuron,  or  the  incident  spike  wavefront  giving  rise  to  it,  changes.  Different  inputs  pr^uce 
therefore  different  symbols  and  corresponding  symbol  probabilities.  The  bifurcating  neuron  can 
therefore  also  be  viewed  as  a  complex  encoder  of  incident  spike  wavefronts. 

The  above  observations  may  enable  speculating  on  possible  learning  in  bifurcating  neural 
networit  in  terms  of  reducing  the  Cross-Entropy, 


(33) 


N 

G  =  I  P(e^)  log 
n=l 


p(e„) 


where  is  the  symbol  probability  produced  by  the  neuron  for  certain  input  (incident  spike 

wavefront)  and  P'fQjj)  is  a  target  symbol  probability  we  wish  the  neuron  to  produce  for  that  input. 
The  learning  task  then  is  how  to  modify  the  synaptic  responses  (synaptic  weights  and  possibly 
time  constants  or  synaptic  delays)  of  the  neuron  so  that  G  is  minimized.  Note  that  G  ->  o  when 

P’(0n)  **  determine  die  synaptic  responses  of  neurons  in  a  population 

or  pool  of  interconnected  bifurcating  neurons  in  which  a  portion  of  the  neurons  receive  external 
inputs  such  that  given  input  patterns  which  can  be  spatial  or  spatio-temporal,  end  up  producing 
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desired  distinct  relative  phase  patterns  when  the  network  gets  synchronized.  The  learning 
algorithm  developed  must  obey  the  usual  minimal  perturbation  principle,  i.e.,  reoccurrence  of  input 
patterns  that  have  already  been  learned  would  not  alter  the  synaptic  weights  and  only  novel  inputs 
would  produce  synaptic  response  changes  that  minimally  perturb  the  information  already  stared. 

(g)  Although  in  preceding  sections  we  refer  to  erratic  firing  of  our  bifurcating  model  neurons 
under  certain  conditions  of  periodic  activation,  the  Lyapunov  exponents  associated  with  values  of 
a  and  f^  in  the  Arnold  Tongues'  diagram  falling  between  the  phase-lock  regions  have  never  been 
found  to  be  positive  which  preclues  explicit  chaotic  firing.  The  seemingly  erratic  firing  regions  in 
the  bifurcation  diagram  would  therefore  be  representing  period-m  firing  with  long  periods,  i.e., 

large  m  or  equivalently  firing  modalities  where  a  long  sequence  of  non-repeating  values  of  0^ 

occurs  before  the  value  of  0^^  is  repeated.  This  observation  is  supponed  by  the  Poincarc- 
Bendixon  theorem  [30]  which  says  that  a  system  whose  nonlinear  dynamics  are  governed  by  two 
autonomous  first-order  ordinary  differential  equations  containing  nonlinear  coupling  terms  cannot 
exhibit  chaotic  behavior  (see  also  pages  2-5  of  reference  20).  The  behavior  of  the  cosinusoidally 
driven  bifurcating  neuron  model  analyzed  in  Section  2  is  governed  by  two  differential  equations: 


(34) 

and 


dt 


E  -  V 
R 


<t>(v  -  u) 


(35) 


(0 


where  u  =  a  costp  and  <p  =  cot  +  0^,  0^  being  a  constant  and  action  potentials  are  produced 
whenever  v  reaches  threshold  (see  eq.  5).  According  to  the  Poincare  Bendixon  theorem  this 
system  of  equations  cannot  exhibit  chaotic  solutions  i.e.,  chaotic  sequences  of  firing  phases  0^^.  It 
is  capable  of  producing  as  we  stated  long  sequences  of  period-N  that  appear  erratic  to  an  observer 
if  N  is  very  large  but  are  not  strictly  chaotic.  Therefore  in  order  to  exhibit  chaos,  our  bifurcating 
neuron  model  must  be  modified  in  such  a  way  as  to  make  its  dynamics  governed  by  at  least  three 
autonomous  first-order  differential  equations  with  nonlinear  coupling  terms.  This  could  occur 

when,  for  example,  the  threshold  level  v^  of  the  S-shaped  nonlinearity  i  =  <t)(v)  is  not  constant,  as 
assumed  in  the  analysis  of  Section  2,  but  obeys  a  differential  equation  of  its  own  like, 

(36)  ^  ‘ 


where  a  is  a  constant  and  f(v)  is  a  nonlinear  function  of  v.  Equation  (36)  adds  adaptive 
thresholding  or  accommodation  to  our  bifurcating  neuron  model  making  it  resemble  more 
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biological  neurons  which  are  known  to  exhibit  accommodation.  It  is  worth  noting  that  the 
Hodgkin-Huxley  model  of  the  biological  membrane  (18]  and  the  MacGregor  and  Oliver  version  of 
it  [31]  consist  of  N_>  3  autonomous  first-order  differential  equations  with  nonlinear  coupling 
terms.  They  are  expected  therefore  to  exhibit  chaotic  solutions  for  certain  regions  of  their 
parameter  spaces.  It  would  seem  therefore  that  under  suitable  conditions  of  periodic  activation, 
biological  neurons  in  the  cortex  would  exhibit  chaotic  firing.  Indeed  chaotic  firing  regimes  have 
been  observed  experimentally  in  the  periodically  driven  biological  membrane  (the  squid’s  giant 
axon)  and  the  Molluscan  neuron  [33]-[36]. 

Inclusion  of  adaptive  thresholding  or  accommodation  in  the  analysis  of  the  bifurcating 
model  neuron  described  in  this  paper,  although  analytically  challenging,  is  very  well  likely  to  lead 
to  complex  firing  modalities  under  periodic  activation  that  include  full  blown  chaotic  firing.  One 
may  well  ask  at  this  point  why  is  chaotic  firing  important?  The  following  is  an  attempt  to  answer 
this  question. 

Chaos  describes  a  strange  intermediate  state  lying  between  rigid  organization,  i.e.,  order, 
and  complete  disorganization,  i.e.,  disorder  or  randomness.  It  also  connotes  something  between 
predictability  and  chance,  between  a  deterministic  signal  and  noise. 

Chaos  in  neurodynamics  could  have  beneficial  consequences  because  decision  making 
processes  that  show  chaos  mix  consistency  with  unpredictibility  and  could  overcome  the 
limitations  of  both. 

Understanding  chaos  and  learning  to  use  it  in  the  design  of  in  man-made  systems  in  general 
and  in  neurodynamics  in  particular  could  be  the  key  for  increasing  the  power  of  neurocomputing 
by  extending  their  capabilities  to  higher-level  processing  like  cognition,  feature  and  concept 
binding,  inferencing,  reasoning,  attention  focusing,  and  possibly  improved  learning  and 
optimization. 

Possible  roles  for  chaos  are: 

•  Efficient  search  of  state-space  of  a  network  that  is  neither  systematic  or  random,  but  is 
driven  by  the  dynamics  of  the  network  itself. 

•  As  inherent  (self-generated)  adaptive  (signal  dependent)  source  of  "noise"  in  a  network, 
chaos  could  possibly  furnish  a  mechanism  for  "self-annealing"  the  network  into  phase- 
locked  states  of  "energy"  minima  that  are  cognitively  meaningful. 

4.  CONCLUSIONS.  The  study  of  bifurcating  and  spiking  neural  networks  and  the 
roles  of  synchronicity,  bifurcation  and  chaos  in  such  networks  is  still  in  its  infancy.  It  is  however 
rapidly  growing  because  of  its  promises  to  provide  better  understanding  of  higher-level  cortical 
functions  and  of  how  to  incorporate  them  in  artificial  neural  networks  to  enhance  the  power  of 
neurocomputing.  Recent  developments  [251,(37]  are  examples  of  this  trend.  The  study  of  the 
dynamics  of  the  bifurcating  model  neuron  presented  here  is  a  step  towards  understanding  how  the 
functional  complexity  of  the  individual  processing  element  governs  the  dynamics  of  bifurcating 
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neural  networks  specially  when  they  enter  synchronized  states  and  towards  learning  how  to  use 
synchronicity  bifurcation  and  chaos  in  the  design  of  a  new  generadon  of  computing  structures. 
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Abstract.  We  discuss  the  application  of  front  tracking  to  the  simulation  of  shock  reflec¬ 
tions  and  shock  accelerated  interfaces.  Some  key  features  of  the  front  tracking  method  are 
the  elimination  of  numerical  diffusion  and  the  reduction  of  wall  heating.  In  computations  of 
the  regular  Mach  reflection  of  a  shock  at  an  oblique  ramp,  we  see  enhanced  resolution  of  the 
primary  waves  in  the  interaction.  In  addition,  tracking  allows  very  precise  measurements  to 
be  made  of  the  states  and  location  of  the  Mach  triple  point.  Our  computations  of  the  growth 
rate  of  a  Richtmyer-Meshkov  unstable  interface  are  the  first  numerical  results  that  are  in 
quantitative  agreement  with  experimental  results  of  a  shocked  air-SFe  interface.  Previous 
attempts  to  model  the  growth  rate  of  the  instability  have  produced  values  that  are  almost 
twice  that  of  the  experimental  measurements.  Moreover,  the  failure  of  the  impulsive  model,, 
and  the  linear  theory  from  which  it  is  derived,  to  model  experiments  correctly  is  understood 
in  terms  of  time  limits  on  the  validity  of  the  linear  model. 


Keywords:  Front  tracking,  Mach  refection,  Richtmyer-Meshkov 

1.  Introduction 

In  this  article  we  present  results  of  simulations  using  front  tracking  combined  with  a 
second  order  Godunov  finite  difference  method.  Two  classes  of  problems  axe  discussed,  the 
oblique  reflection  of  shock  waves  at  ramps,  and  the  computation  of  the  instability  growth 
rate  of  a  perturbed,  shock-accelerated  interface.  Our  code  achieves  excellent  resolution 
of  the  simulated  flows,  even  on  the  relatively  coarse  grids  used  here.  In  both  cases  our 
computed  results  are  shown  to  be  in  excellent  agreement  with  experiments.  Indeed,  we 
present  computations  of  the  Richtmyer-Meshkov  instability  that  for  the  first  time  agree  with 
experimentally  measured  growth  rates  of  interface  perturbations. 

The  ramp  reflection  simulations  model  the  interaction  of  a  planar  shock  wave  with  an 
oblique  wall.  We  are  interested  in  determining  the  structure  of  the  reflection  process  for  ramp 
angles  that  are  very  close  to  the  mechanical  equilibrium  condition  for  bifurcation  to  regular 
reflection,  as  defined  by  the  coincidence  of  regular  and  Mach  reflection.  The  use  of  front 
tracking  allows  us  to  conduct  numerical  experiments  that  are  extremely  close  to  this  point. 
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Ordinary  shock  capturing  methods  are  unable  to  resolve  the  Mach  triple  point  configuration 
in  this  regime,  due  to  the  extreme  closeness  of  this  point  to  the  wall.  We  measure  several 
quantities  for  the  reflection  process,  including  the  trajectory  of  the  triple  point,  and  the 
Mach  number  of  the  flow  behind  the  foot  of  the  Mach  stem.  Comparison  of  these  quantities 
to  experiment  shows  a  good  agreement  between  our  values  and  the  experimentally  measured 
quantities. 

The  Richtmyer-Meshkov  instability  concerns  the  growth  of  interface  perturbations  on  a 
shock  accelerated  material  interface.  When  a  shock  wave  collides  with  the  interface  between 
two  different  materials,  small  perturbations  of  this  interface  grow  into  nonlinear  structures 
having  the  form  of  “bubbles”  and  “spikes”.  The  occurrence  of  this  shock-induced  instabil¬ 
ity  was  predicted  by  Richtmyer  [15]  and  confirmed  experimentally  by  Meshkov  [12].  The 
Richtmyer-Meshkov  instability  is  similar  to  the  more  familiar  Rayleigh-Taylor  instability 
and  is  important  in  both  natural  phenomena  (supernovae)  and  technological  applications 
(inertial  confinement  fusion). 

Theory  and  computation  have  so  far  failed  to  provide  an  understanding  of  the  Richtmyer- 
Meshkov  instability  that  is  in  quantitative  agreement  with  existing  experiments  [3,  4,  6, 
14,  16).  Computations  of  the  Richtmyer-Meshkov  instability  for  singly  shocked,  sinusoidally 
perturbed  interfaces  have  over- predicted  growth  rates  by  factors  from  40%  to  100%  [6]  as 
compared  to  experiments.  The  main  theoretical  model  used  in  this  area,  Richtmyer’s  impul¬ 
sive  model  [15],  also  consistently  predicts  a  growth  rate  that  is  too  large. 

Our  computations  of  the  Richtmyer-Meshkov  instability  are  further  validated  by  a  compar¬ 
ison  of  small  amplitude  perturbation,  early  time  simulations  with  solutions  to  a  linearization 
of  the  equations  of  motion.  An  analysis  of  the  time  intfval  of  validity  for  the  linearized 
model  explains  the  failure  of  the  linearized  and  impulsive  models  to  agree  with  experiment. 

2.  The  Front  Tracking  Method 

Front  tracking  is  a  computational  method  for  the  sharp  resolution  of  a  set  of  distinguished 
waves  in  a  flow.  It  combines  a  standard,  rectangular  grid  based  finite  difference  method  with 
a  set  of  lower  dimensional,  dynamically  moving  grids  that  follow  the  tracked  wave  fronts. 
A  general  description  of  this  method,  including  an  outline  of  the  structure  of  our  computer 
program,  is  given  in  [10]. 

The  numerical  solution  for  flows  in  two  space  dimensions  is  represented  on  the  union  of 
a  rectangular  grid  and  a  set  of  piecewise  linear  curves.  The  state  at  each  point  on  the 
rectangular  grid  represents  the  cell  average  over  the  corresponding  cell  of  the  dual  grid 
centered  at  that  point.  The  solution  at  a  point  on  a  tracked  front  is  multivalued,  with 
values  corresponding  to  the  limits  of  the  solution  on  either  side  of  the  wave.  The  numerical 
representation  of  the  flow  explicitly  includes  the  jump  discontinuities  across  the  tracked 
waves  and  thus  eliminates  numerical  diffusion. 

Points  of  intersection  between  tracked  waves,  called  nodes,  correspond  to  two-dimensional 
interactions  between  wave  fronts.  An  important  example  of  such  a  node  in  these  computa¬ 
tions  is  the  Mach  triple  point. 

A  global  solution  operator  for  the  evaluation  of  the  state  of  the  flow  at  arbitrary  locations 
is  constructed  from  a  front-limited  triangulation  of  the  computational  grid  and  the  tracked 
fronts.  This  triangulation  is  constrained  so  that  no  triangle  crosses  a  tracked  front.  A  side  of 
an  individual  triangle  in  this  construction  is  either  a  rectangulax  lattice  cell  boundary,  or  an 
edge  on  a  tracked  front.  A  corner  of  such  a  triangle  is  thus  either  a  grid  cell  corner  or  a  point 
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on  a  tracked  front  or  an  intersection  of  a  tracked  front  with  a  lattice  cell  boundary.  The 
states  at  these  points  serve  as  data  for  a  linear  interpolant  of  the  solution  into  the  interior 
of  the  triangle. 

The  representation  of  the  solution  in  our  front  tracking  code  differs  from  the  more  standard 
triangular  representations  of  a  flow  in  that  the  tracked  waves  are  dynamic  and  move  with 
time  so  that  the  triangulation  must  be  regenerated  at  each  time  step  in  the  computation.  The 
method  also  differs  from  the  unstructured  finite  volume  techniques  in  that  the  main  solution 
is  computed  using  a  regular,  rectangular  grid.  Subsequently  the  states  at  grid  points  that 
lie  within  the  domain  of  influence  of  the  tracked  waves  over  the  time  step  must  be  corrected 
to  account  for  the  presence  of  these  waves.  It  is  important  to  note  that  the  front  tracking 
code  combines  both  front  tracking  with  shock  capturing,  so  that  secondary  waves  (such  as 
breaking  shocks  or  induced  slip  lines)  are  resolved  within  the  ability  of  the  underlying  mesh 
spacing  and  the  finite  difference  scheme. 

The  propagation  operator  that  updates  the  numerical  solution  over  a  single  time  step 
consists  of  three  basic  parts:  propagation  of  the  tracked  wave  fronts  (point  propagation), 
propagation  of  points  of  Interaction  between  tracked  waves  (node  propagation),  and  update 
of  the  states  on  the  rectangular  grid  (interior  solver).  For  the  latter  operation,  the  fronts  at 
the  beginning  and  end  of  the  time  step  serve  as  internal  boundaries  for  the  regwns  adj^^ing 
those  waves. 

The  first  propagation  phase  consists  of  the  propagation  of  the  non-nodal  points  on  the 
tracked  waves.  At  each  tracked  point  a  local  rotation  of  coordinates  is  performed  that  aligns 
the  coordinate  axes  with  the  normal  and  tangential  directions  of  the  curve  at  that  point. 
The  tangent  to  a  piecewise  linear  curve  at  a  point  is  defined  as  the  line  through  that  point 
which  is  parallel  to  the  secant  through  the  neighboring  points.  Operator  splitting  is  used  to 
divide  the  front  propagation  into  two  one-dimensional  units:  a  normal  propagation  step  and 
a  tangential  propagation  step.  The  normal  propagation  of  the  tracked  waves  is  computed 
using  a  second  order  Riemann  problem- type  method  as  described  in  [5].  This  operator  solves 
a  piecewise  linear  Cauchy  problem  and  is  similar  to  the  van  Leer  [17]  flux  computation  as  used 
in  the  second  order  Godunov  method.  The  tangential  operator  uses  a  one-dimensi(Hia)  finite 
difference  method,  which  in  our  code  is  precisely  the  same  as  the  interior  finite  difference 
scheme.  The  tangential  stencil  at  a  given  point  on  the  partially  propagated  wave  is  formed 
by  projecting  the  adjacent  curve  states  onto  the  tangent  at  that  point.  The  interested  reader 
is  referred  to  references  [5,  10]  for  a  more  detailed  description  of  these  operations. 

An  interaction  between  tracked  waves  is  locally  approximated  by  a  two  dimensional  Rie¬ 
mann  problem,  which  is  defined  as  the  Cauchy  problem  for  initial  data  that  is  scale  invariant 
with  respect  to  the  node  position  at  the  start  of  the  time  step,  i.e.  constant  along  rays  from 
the  node  position.  The  numerical  solution  is  computed  using  shock  polar  analysis.  Refer¬ 
ences  [8,  9]  contain  a  description  of  the  node  propagation  algorithm.  The  primary  nodes  of 
interest  in  the  present  calculations  are  the  Mach  triple  point  and  the  diffraction  node  formed 
during  the  Richtmyer- Meshkov  calculation  by  the  intersection  of  a  tracked  shock  wave  with 
the  material  interface. 

The  interior  finite  difference  scheme  that  computes  the  solution  on  the  rectangular  grid  is 
an  operator  split,  second  order  MUSCL  scheme  [2,  7].  Our  implementation  uses  a  five  point 
stencil  with  linear  reconstruction. 

The  rectangular  grid  update  consists  of  two  passes:  a  regular  and  an  irregular  grid  update. 
First,  the  finite  difference  equations  are  solved  for  the  the  rectangular  grid  alone,  ignoring 
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Regular  Reflection  Mach  Reflection 


Figure  1 .  a  schematic  of  regular  and  Mach  reflection.  Here,  x  i^he  Mach  node  tra¬ 
jectory  angle,  and  0„,  is  the  ramp  angle.  The  incident  shock  is  moving  from  the  left  to  the 
right. 

the  tracked  fronts.  The  second  pass  then  updates  the  states  at  the  rectangular  grid  points 
near  the  tracked  fronts.  If  a  tracked  front  crosses  the  finite  difference  stencil  of  a  rectangtilar 
grid  point  during  the  time  step,  the  states  computed  at  that  location  by  the  first  interior 
sweep  must  be  discarded  and  recomputed  to  account  for  the  presence  of  the  tracked  wave. 
In  our  implementation  we  use  a  locally  modified  stencil  at  each  such  point  to  compute  the 
updated  solution.  We  start  with  copies  of  the  states  on  a  five  point  stencil  centered  at  the 
given  location.  We  then  find  the  tracked  fronts,  if  any,  on  either  side  of  the  stencil  center 
that  are  closest  to  the  middle  location.  We  replace  any  states  in  the  stencil  lying  on  the 
opposite  side  of  the  nearest  front  by  copies  of  the  state  on  the  correct  side  of  the  front  at 
the  stencil  crossing,  as  computed  by  linear  interpolation  between  the  tracked  points  on  the 
curve.  Thus  the  final  solution  never  uses  finite  differencing  across  tracked  waves. 

The  organization  of  the  interior  solver  into  two  passes  allows  the  bulk  of  the  code  for  this 
part  of  the  computation  to  be  vectorized.  Note  that  even  though  our  computations  used 
a  nonvector  machine  (Sun  Sparcstation?),  this  organization  provides  a  substsmtiad  improve¬ 
ment  in  performance  on  vector  machines  and  adds  little  additionad  overhead  on  nonvector 
computers. 


3.  Shock  Wave  Reflection  at  Ramps 

The  wadi  reflection  problems  consisted  of  a  shock  wave  in  aur  (modeled  ais  a  perfect  gats 
with  7  =  1.4)  colliding  with  an  oblique  ramp.  The  gas  athead  of  the  shock  wave  was 
at  atn  atmbient  pressure  of  0.3  bairs  amd  a  temperature  of  300**  K,  with  am  incident  shock 
Match  number  of  2.0.  Two  interactions  were  modeled  with  ratmp  amgles  0t„  of  46®  amd  49® 
respectively.  A  computationad  grid  of  256  x  256  zones  wats  used  for  both  simulations,  which 
were  conducted  on  a  Sun  Microsystems  Sparcstation2  with  64  megabytes  of  RAM.  The  46® 
run  required  approximately  16  CPU  hours  to  complete,  while  the  49®  required  about  20 
CPU  hours.  The  difference  in  CPU  time  is  primarily  due  to  the  smaller  vadue  of  At  required 
in  the  second  run.  We  note  that  our  investigations  have  shown  that  a  much  coarser  grid 
would  resolve  the  basic  structure,  amd  much  of  the  interior  structure  of  the  reflected  shock 
bubble.  We  have  gotten  satisfactory  restdts  for  the  main  front  locations  on  grids  ais  coarse 
as  100  X  100. 

Our  computation  is  initiadized  just  after  the  shock  crosses  the  ratmp  comer,  when  the 
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(a) 


(b) 


Figure  2.  (a)  Density  contours  from  the  first  of  the  runs,  using  =  46®,  po  =  0.3  bars, 
and  To  =  300®  K.  The  Mach  triple  point  trajectory  angle  x  =  2.557®.  (b)  The  tracked  wave 
fronts  from  the  same  computation. 

reflected  wave  bubble  is  just  a  few  mesh  blocks  in  height.  Tracking  the  complete  reflected 
shock  bubble  requires  an  estimate  of  the  initial  geometry  of  this  object  to  serve  as  a  seed 
for  the  final  computed  configuration.  We  use  the  following  technique  to  install  the  initial 
reflected  waves.  We  assume  the  Mach  stem  is  initially  normal  to  the  ramp.  The  Mach 
trajectory  angle  x  is  then  determined  algebraically  by  the  condition  that  the  turning  angle 
through  the  incident  and  reflected  waves  in  a  frame  that  moves  with  the  Mach  triple  point  is 
the  same  as  the  turning  angle  through  the  Mach  stem,  i.e.  the  flow  behind  the  configuration 
must  be  parallel  to  the  slip  line  on  both  sides.  The  data  for  this  system  of  equations  consists 
of  the  ramp  angle,  and  the  incident  shock  data  (ahead  and  behind  states).  This  system  is 
solved  using  an  iteration  on  x,  which  determines  the  local  states  and  wave  angles  about  the 
triple  point.  The  Mach  stem  and  contact  are  installed  as  straight  line  segments  from  the 
triple  point  to  the  wall,  at  the  computed  angles.  The  position  of  the  bow  of  the  reflected 
shock  behind  the  ramp  is  found  by  solving  a  head-on  reflection  Riemann  problem  for  the  state 
behind  the  incident  shock.  The  shape  of  the  initiad  reflected  wave  is  composed  of  two  pieces. 
The  first  is  a  straight  piece  at  the  triple  point  tangent  to  the  computed  reflected  wave,  and 
the  second  an  ellipse  from  the  end  of  this  segment  to  the  bow  point,  with  axis  on  the  ramp 
and  center  at  the  corner.  It  is  importeint  to  note  that  this  construction  is  only  performed 
once  at  the  beginning  of  the  computation.  The  subsequent  propagation  of  the  Mach  triple 
point  and  the  bow  node  uses  only  local  information  about  those  points.  In  particular  there 
is  no  restriction  that  the  Mach  stem  remain  straight,  or  that  the  reflected  wave  have  any 
particular  shape.  These  properties  are  determined  dynamically  by  the  computation.  In  fact, 
the  shapes  of  the  waves  at  later  times  appear  to  be  independent  of  any  reasonable  initial 
configuration.  Figure  1  shows  a  schematic  of  the  basic  geometry  of  the  reflection. 

Figures  2  and  3  show  the  results  of  our  computations  of  the  two  simulations  outlined  above. 
Figures  2a  and  3a  show  density  contours,  while  figures  2b  and  3b  show  only  the  tracked  wave 
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(a) 


(b) 


Figure  3.  (a)  Density  contours  for  the  second  of  the  runs.  This  computation  differs  from 
the  first  only  in  that  =  49®.  Here  the  Mach  triple  point  trajectory  angle  x  =  1-6®.  (b) 

The  tracked  wave  fronts  from  the  same  computation. 

fronts.  The  density  difference  of  adjacent  contours  is  4%  of  the  density  of  the  unshocked 
2ur.  We  see  very  sharp  resolution  of  the  tracked  waves,  and  in  particular  at  the  t^'iple  point. 
This  resolution  becomes  even  more  important  as  we  move  closer  to  the  bifurcation  point 
to  regular  reflection,  which  we  will  discuss  more  fully  in  the  context  of  the  next  series  of 
simulations.  Let  us  just  say  here  that  our  code  supports  the  von  Neumann  criterion  over  the 
detachment  criterion  as  the  location  of  this  point,  and  these  two  runs  appear  to  beax  that 
out,  since  =  53.36®,  and  =  50.898®.  We  are  relatively  close  to  0e  in  Figure  3,  but 
still  have  a  distinct  Mach  reflection  structure  with  a  sizeable  Mach  stem. 

There  is  a  considerable  amount  of  activity  in  the  flow  at  the  point  where  the  slip  line 
induced  from  the  Mach  triple  point  reaches  the  wall.  The  slip  boxmdary  conditions  at  the 
wall  require  that  the  flow  there  be  parallel  to  it,  and  hence  there  is  a  laxge  gradient  in  the 
velocity  as  the  flow  adjusts  to  the  obstacle.  Tracking  the  slip  line  reduces  the  amount  of 
numerical  spre2uling  of  this  wave  at  the  wall,  which  in  turn  enhances  the  resolution  of  the 
flow  about  the  triple  point  by  preventing  its  contamination  by  the  transient  waves  produced 
at  the  boundary. 

The  density  contours  of  figures  2a  and  3a  show  that  our  computation  is  doing  a  good 
job  of  reducing  the  effect  of  wall  heating  at  the  ramp  boundary.  Most  of  the  contours  axe 
relatively  smooth  going  into  the  boundary,  which  is  consistent  with  the  inviscid  model  used 
for  these  computations.  We  made  no  attempt  to  model  the  boundary  layer  effects  that  are 
present  in  a  real  experiment. 

Another  feature  of  our  code  is  the  ability  to  measure  vaxious  physical  quantities  very 
precisely  with  respect  to  our  computations.  For  example,  we  know  the  exact  states  around 
the  triple  point  or  at  the  base  of  the  Mach  stem  by  simply  reading  them  from  the  curve  data 
structures  in  the  computation. 

In  a  separate  seri^  of  runs,  we  validated  our  computations  of  regular  Mach  reflections 
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Figure  4.  a  schematic  representation  of  the  experimental  apparatus  used  to  produce 
Mach  reflections.  This  configuration  produces  a  pair  of  symmetric  reflections  as  the  incoming 
shock  passes  the  apex  of  the  wedge. 

by  comparing  them  with  experiments  performed  at  the  University  of  Sydney  by  Henderson 
and  Virgona  [11].  Considerable  effort  was  made  to  simplify  the  experiments  in  order  to 
facilitate  comparison  with  the  numerical  results.  For  example,  the  Mach  reflections  were 
generated  in  argon  (7  =  1,667)  so  as  to  eliminate  vibrational  non-equilibrium,  dissociation, 
and  chemical  reactions.  The  strength  of  the  incident  shock  i  was  sufficiently  large  to  ensure 
that  the  flow  downstream  of  the  reflected  shock  r  was  supersonic,  but  not  so  strong  as  to 
ionize  the  argon.  More  precisely,  the  average  strength  of  i  used  in  the  experiments,  or  rather 
the  average  inverse  strength  =  Po/pi  was  =  0.1534,  corresponding  to  an  incident  Mach 
number  of  2.327.  The  ahead  state  pressure  and  temperature  were  po  =  14.1  ±  3.0  kPa 
and  To  —  293.15  ±  4.0*  K.  The  Mach  reflections  were  generated  by  diffracting  incident 
shocks  over  a  series  of  symmetrical  wedges  of  different  apex  semi-angles  0^,  (Figure  4).  This 
design  eliminated  shock-boimdary  layer  interaction  at  the  apex  of  every  wedge.  The  more 
conventional  concave  comer  model  (Figure  1)  undergoes  significant  shock-boundary  layer 
interaction  as  the  reflected  shock  r  sweeps  over  the  lower  wall.  The  newer  model  eliminates 
this  effect,  and  is  mathematically  equivalent  to  the  model  in  Figure  1  in  the  inviscid  case. 
Considerable  development  work  was  done  on  the  design  of  the  boimdary  layer  spill  slots  on 
the  side  walls  of  the  shock  tube.  The  objective  was  to  minimize,  as  far  as  practical,  the 
shock  boundary  layer  on  the  side  walls  of  the  tube. 

A  graph  of  Mach  triple  point  trajectory,  x,  versus  the  ramp  angle,  is  shown  in  Figure  5a. 
These  figures  show  the  experimentally  measured  value  of  x  together  with  the  values  computed 
by  our  front  tracking  code,  as  well  as  those  computed  by  Colella  [11]  using  a  highly  resolved 
shock  capturing  scheme.  This  measurement  was  attractive  because  x  cannot  be  computed 
from  the  shock  polar  analysis  used  to  compute  the  local  configuration  at  the  node.  Its  value 
is  entirely  a  result  of  the  interaction  of  the  numerics  in  our  code,  and  its  experimental  value 
is  particularly  robust.  We  have  also  found  the  Mach  number  behind  the  base  of  the  Mach 
stem  to  be  a  very  useful  measurement,  for  exactly  the  same  reasons. 

As  can  be  seen  from  the  picture,  our  results  agree  very  well  with  both  the  experimental 
and  the  shock  capturing  results.  We  note  that  both  sets  of  computational  results  are  on  the 
high  side  of  the  experimental  values.  This  difference  has  been  attributed  to  boundary  layer 
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Figure  5.  a  comparison  of  the  Mach  triple  point  trajectory  for  experiments,  fine  grid 
shock  capturing  computations,  and  front  tracking.  We  note  that  both  numerical  methods  are 
in  substantial  agreement  with  each  other,  and  are  close  to  the  experimental  measurements. 

Front  tracking  provides  approximately  the  same  value  for  x  ^  ^he  shock  capturing  code, 
using  only  a  fraction  of  the  grid  resolution. 

effects. 

An  application  of  important  interest  is  the  transition  conditions  between  regular  and 
Mach  reflection  at  a  wall.  It  is  well  known  that  for  certain  flow  regimes  there  is  an  overlap 
between  the  regions  in  phase  space  where  regular  and  Mach  reflections  are  possible.  Both 
experimental  and  computational  investigations  have  shown  that  the  boundary  layer  at  the 
wall  plays  an  important  role  in  the  process  that  selects  the  type  of  wave  produced  by  the 
wall  reflection.  In  order  to  quantify  the  effect  of  the  wall  boundary  layer  it  is  important  to 
understeind  the  inviscid  limit  of  the  solution  where  this  boundary  layer  is  absent.  It  is  in 
the  computation  of  this  inviscid  limit  that  a  major  strength  of  the  front  tracking  method  is 
revealed.  Since  we  cire  explicitly  tracking  the  most  singular  parts  of  the  calculation,  we  can 
make  very  precise  statements  about  exactly  where  a  given  discontinuity  is  located;  there  is  no 
numerical  diffusion  of  the  fronts.  This  allows  us  to  perform  computations  near  the  transition 
to  regular  reflection,  yielding  structure  that  is  not  resolvable  in  either  experiment  or  standard 
shock  capturing  codes.  We  can  resolve  the  the  full  Mach  triple  point  configuration  for  angles 
X  as  small  as  0.1°  (Figure  5b).  In  such  simulations  the  Mach  stem  is  less  than  a  grid  block 
long.  By  contrast,  shock  capturing  codes  generally  lose  the  resolution  of  the  Mach  triple  point 
when  the  length  of  Mach  stem  is  less  that  two  or  three  grid  blocks.  This  loss  of  resolution  is 
due  to  the  presence  of  a  numerical  boundary  layer  at  the  wall.  This  regime  is  also  difficult 
to  approach  experimentally  due  to  the  real  viscous  boundary  layer  at  wall.  Currently,  front 
trticking  appears  to  be  the  only  method  that  can  conduct  numerical  simulations  of  inviscid 
wall  reflections  to  within  a  small  fraction  of  a  degree  of  the  mechanical  equilibrium  condition. 

The  resolution  for  the  front  tracking  runs  was  achieved  on  grids  which  are  much  coarser 
than  those  used  in  standard  finite  difference  simulations  of  this  problem.  Most  of  our  grids 
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Figure  6.  A  schematic  representation  of  the  geometry  of  the  Richtmyer-Meshkov  insta¬ 
bility  modeled  in  this  paper.  The  interaction  consists  of  the  collision  of  a  shock  wave  with 
a  material  interface.  The  refraction  of  the  shock  by  the  interface  produces  reflected  and 
transmitted  waves.  The  instability  consists  of  the  growth  of  perturbations  of  the  material 
interface  with  time. 

were  100  x  100,  up  to  about  150  x  150  close  to  the  transition  point.  This  also  gives  a 
commensurate  savings  in  time  -  the  runs  took  between  one  and  four  hours. 

It  should  also  be  pointed  out  that  for  the  region  between  9e  and  9s,  our  code  can  simulate 
either  regular  or  Mach  reflection  -  both  are  theoretically  possible  in  this  region.  However, 
based  on  the  shape  of  the  curve  outside  this  region  (see  Figure  5b),  our  results  definitely 
seem  to  converge  to  the  point  9s,  and  there  is  no  reason  to  expect  a  discontinuity  in  the 
curve  at  9^-  We  feel  that  this  is  a  very  strong  statement  that  in  this  pairameter  regime, 
the  bifurcation  from  Mach  Reflection  to  regular  reflection  takes  place  at  the  mechanical 
equilibrium  condition  and  not  at  the  detachment  point. 

4.  Numerical  Simulation  of  the  Rjchtmyer-Meshkov  Instability 

We  focus  on  the  simplest  case  of  the  shock  tube  experiments  of  the  Richtmyer-Meshkov 
instability  where  a  sine  shaped  material  interface  is  accelerated  by  a  single  shock  wave,  as 
in  the  experiments  of  Meshkov  [12],  Benjamin  [3,  4],  and  others.  The  general  configuration 
of  the  computation  and  experiments  is  shown  in  Fig.  6.  A  thin  membrane  was  used  in  the 
experiments  to  separate  the  two  gases  at  the  materiail  interfaw:e.  Quantitative  agreement  was 
achieved  between  our  computational  results  and  the  experimental  measurements  of  Ben¬ 
jamin  [4]  for  the  rate  of  growth  of  a  shocked  air-SFe  interface.  The  collision  results  in  a 
transmitted  shock  and  a  reflected  wave  that  can  be  either  a  shock  or  a  rarefaction  depending 
on  the  values  of  the  fluid  parameters.  The  experiments  considered  in  this  paper  are  of  the 
reflected  shock  type.  Viscosity  and  heat  conduction  are  negligible  here,  so  the  fluid  motion 
is  described  by  the  Euler  equations. 

The  impulsive  model  proposed  by  Richtmyer  [15]  is  commonly  used  to  estimate  the  growth 
rate  of  a  shock  accelerated  interface.  This  model  is  derived  by  assuming  that  the  shock 
acceleration  can  be  treated  as  being  impulsive,  and  that  the  flow  is  nearly  incompressible 
once  the  shock  wave  has  passed  through  the  materiail  interface.  It  is  also  assumed  that  the 
flow  is  observed  in  a  frame  where  the  average  position  of  the  material  interface  is  at  rest,  and 
the  position,  t/(x,  t),  of  the  material  interface  at  time  t  can  be  given  by  y{x,  t)  =  a{t)  sin  kx, 
where  k  is  the  wave  number  of  the  perturbation.  Richtmyer’s  formula  gives  the  growth  rate 
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of  a(0  as 

a(<)  =  itAu^i^^a(0+),  (4.1) 

Pl+  P7 

where  Am  is  the  difference  between  the  shocked  and  unshocked  mean  interface  velocities,  the 
Pi  are  the  post-shocked  densities  on  the  two  sides  of  the  interface  (the  incident  shock  moves 
from  material  “2”  to  material  “1”),  and  a(O-l-)  is  the  perturbation  amplitude  immediately 
after  the  collision  of  the  shock  with  the  material  interface.  This  formula  implicitly  assumes 
the  initial  preshocked  amplitude,  a(0— ),  is  small  compared  to  the  wavelength. 

Given  that  a(0— )  is  small  so  that  ka{Q—)  1,  a  more  exact  calculation  of  the  ampli¬ 
tude  growth  rate  can  be  made.  The  Euler  equations  are  linearized  around  the  solution  of  a 
one  dimensional  Riemann  problem  defined  by  the  head-on  collision  of  a  planar  shock  with 
a  zero  amplitude  (planar)  material  interface,  using  the  initial  amplitude  of  the  sinusoidal 
perturbation  as  a  small  expansion  parameter.  The  result  of  the  linearization  is  a  system 
of  partial  differential  equations  in  one  spatial  dimension  with  associated  boundary  condi¬ 
tions.  This  system  can  be  solved  numerically  for  the  growth  rate  of  the  perturbed  interface. 
This  approach,  following  Richtmyer  [15],  has  recently  been  generalized  to  include  reflected 
rarefactions  as  well  as  reflected  shocks  [18].  Simple  order  of  magnitude  estimates  limit  the 
validity  of  the  linearized  equations  to  the  dimensionless  time  interval 

t.nun  =  A:a(0-)  <  t.  <  l/[fca(0-)]  =  t.max-  (4.2) 

Here  the  dimensionless  time  t,  =  kcoMot,  where  Mo  is  the  incident  shock  Mach  number,  and 
Co  is  the  sound  speed  of  the  fluid  ahead  of  the  incident  shock.  The  limits  t,min  and  t.m.v 
represent  respectively  the  transit  time  of  the  incident  shock  through  the  perturbed  interface 
and  the  time  required  for  the  perturbation  to  grow  to  unit  amplitude.  Necessarily,  these  time 
limits  apply  to  the  derivation  of  the  impulsive  model  as  well,  since  it  is  an  approximation 
to  the  linear  theory.  Recent  systematic  compau-isons  of  the  impulsive  model  and  the  linear 
theory  have  revealed  both  regions  of  agreement  and  of  disagreement  in  parameter  space  [18]. 

We  compared  our  simulations  of  a  singly  shocked  air-SFe  interface  to  the  experiments  of 
Benjamin  [4].  The  material  interface  is  accelerated  by  a  shock  wave  with  Mach  number  1.2 
moving  from  air  into  SFe-  The  initial  amplitude,  a(0— ),  was  0.00637  times  the  period  of  the 
sinusoidal  perturbation.  For  these  experiments,  t.max  ^  2.5,  while  the  observational  time 
interval  is  15  <  ^.observational  ^  50-  The  observational  times  and  the  validity  of  the  linear 
theory  fail  to  overlap  by  a  factor  of  about  6.  We  conclude  that  the  linear  theory  has  no 
relationship  to  this  experiment. 

Fig.  7  shows  plots  of  the  amplitude  and  amplitude  growth  rate  of  the  material  interface  as 
obtained  from  experiment,  the  front  tracking  simulation,  the  linearized  theory,  and  Richt- 
myer’s  impulsive  model.  The  time  axis  in  these  figures  is  shifted  so  that  <  =  0  corresponds 
to  the  time  at  which  the  shock  wave  has  completed  its  refraction  through  the  interface. 

As  can  be  seen  from  these  figures  the  front  tracking  results  axe  in  substantial  agreement 
with  the  experimental  results  in  the  sense  that  the  growth  rate  derived  by  a  least  squares 
analysis  of  the  amplitude  data,  8.14  m/s,  is  within  the  experimental  range  of  7.9  m/s  ± 
10%.  Note  that  for  late  (i.e.,  experimentally  observed)  times  the  linearized  theory  and  the 
impulsive  model  growth  rates  are  a  factor  of  two  larger  than  those  found  in  experiment  or 
in  our  simulation.  This  may  be  due  to  the  faw^t  that  this  particular  configuration  has  a 
relatively  large  initial  amplitude  and  quickly  leaves  the  region  of  validity  of  the  linearized 
theory  and  impulsive  model.  The  displacement  of  the  experimental  curve  with  respect  to 
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Figure  7.  Perturbation  amplitude,  a{t),  and  amplitude  growth  rate,  d(t),  of  a  shocked 
air-SFs  interface.  This  graph  compares  the  results  of  experimental  averages,  front  tracking 
simulation,  linear  theory  and  Richtmyer’s  impulsive  model.  Also  shown  are  results  of  a  least 
squares  fit  to  the  front  tracking  amplitude  data  over  the  period  of  experimental  observation. 

The  plus  marks  (+)  show  the  results  of  one  particular  experiment,  while  the  experimental 
growth  rate  represents  an  average  over  several  experiments. 

the  front  tracking  curve  is  possibly  due  to  membrane  effects,  i.e.  the  material  strength  of 
the  membrane  or  the  influence  of  its  fragmentation  may  effect  the  fluid  flow. 

The  front  tracking  results  indicate  a  decay  in  amplitude  growth  rates  while  Benjamin  [4] 
finds  a  fairly  constant  growth  rate  during  the  measurement  period.  Other  experiments, 
however,  have  shown  a  decaying  growth  rate  [13,  1].  The  figures  shown  in  this  paper  used  a 
resolution  of  125  zones  per  wavelength,  and  mesh  refinement  studies  in  the  range  of  125-208 
zones  per  wavelength  showed  very  little  change  in  the  amplitude  growth  rate.  We  also  tested 
our  simulation  against  changes  in  other  numerical  parameters  and  found  that  the  value  of 
a{t)  was  insensitive  to  these  changes.  We  conclude  that  this  decay  is  a  real  effect  and  not 
due  simply  to  numerical  dissipation  as  has  been  suggested  [4]. 

A  further  validation  of  the  nonlinear  simulations  can  be  accomplished  by  comparison  to 
the  small  amplitude  theory  (Fig.  8).  This  serves  both  to  determine  the  range  of  validity  of 
the  linear  theory  and  to  validate  the  solution  of  the  full  Euler  equations  at  small  amplitudes. 
As  can  be  seen  in  Fig.  8,  the  front  tracking  calculation  is  converging  to  the  linear  result  as  we 
reduce  the  amplitude.  We  note  that  the  interval  of  convergence  of  the  nonlinear  simulations 
to  the  linear  theory  appears  to  be  finite.  This  is  in  contrast  to  formula  4.2  which  suggests 
that  the  domain  of  validity  of  the  linearized  equations  should  increase  with  decreasing  initial 
amplitude.  This  point  deserves  further  study. 

Of  interest  is  the  question  of  why  our  results  agree  with  experiment  while  results  found 
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Normalized  Time 


Figure  8.  The  convergence  of  the  nonlinear  simulations  to  the  linearized  solution  for 
small  amplitudes.  A  comparison  of  three  separate  calculations  of  the  normalized  pertur- 
bation  growth  rate,  d(t)/[i:coMoa(0— )],  of  a  shocked  air-SFe  interface  with  three  different 
initial  amplitudes  where  k  is  the  wave  number,  cq  is  the  sound  speed  ahead  of  the  incident 
shock,  and  Mq  is  the  incident  shock  Mach  number.  The  horizontal  axis  is  in  dimensionless 
time  units  kcoMot. 

through  other  numerical  methods  do  not.  Prior  disagreement  between  the  growth  rates  mea¬ 
sured  in  experiments  and  those  predicted  by  numerical  simulation  has  led  to  the  suggestion 
that  mass  diffusion  and  membrane  effects  may  have  an  important  role  in  the  behavior  of  the 
interface  instabilities.  Our  work  does  not  exclude  this  possibility,  but  the  agreement  of  our 
computations  with  experiment  suggests  that  a  proper  numerical  resolution  of  the  material 
interface  is  essential  to  obtain  agreement  with  experiment,  and  also  that  if  other  effects  axe 
important,  they  may  be  offsetting  one  another.  It  is  also  clear  that  there  is  still  much  to 
learn  about  the  highly  nonlinear  aspects  of  the  Richtmyer- Meshkov  instability.  These  effects 
include  the  possible  coupling  between  nonlinear  modes,  and  their  study  will  require  exper¬ 
iments  on  singly  shocked  interfaces  aa  well  as  computations  with  random  interfaces  which 
have  been  run  to  late  times.  Similarly,  understanding  the  effects  of  reshocking  remains  an 
important  theoretical  challenge.  For  the  single  mode  case,  a  systematic  study  of  mass  diffu¬ 
sion,  membrane  effects,  and  a  detailed  comparison  to  earlier  calculations  of  others  would  be 
desirable. 
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ABSTRACT.  We  discuss  the  applications  of  high  order  compact  finite  difference  methods 
for  shock  calculations.  Nonlinear  stability  is  achieved  through  the  definition  of  a  local  mean 
which  serves  as  a  reference  for  introducing  a  local  flux  limiting  to  control  spurious  numerical 
oscillations  while  keeping  the  formal  accuracy  of  the  scheme.  For  scalar  conservation  laws,  the 
resulting  schemes  can  be  proven  total  variation  stable  in  one  space  dimension  and  maximum 
norm  stable  in  multi  space  dimensions.  Numerical  examples  are  shown  to  verify  accuracy 
and  stability  of  such  schemes  for  problems  containing  shocks. 


1  Introduction 

Compact  schemes  are  methods  where  the  derivatives  are  approximated  not  by  polynomial 
operators  but  by  rational  function  operators  on  the  discrete  solutions.  We  are  interested  in 
solving  a  hyperbolic  conservation  law 

tit  + /(ti),  +  5(«)»  =  0 

u(x,y,0)  =  u°(x»y)  (1-1) 

using  compact  schemes.  In  the  semi-discrete  form,  a  compact  scheme  for  solving  (1.1)  can 
be  written  as 

^  =  L{u),i  (1,2) 

where  A  and  B  are  both  local,  one  dimensional  operators.  The  subscript  x  or  y  indicates 
that  the  operator  is  applied  in  the  x  or  y  direction. 

As  examples,  a  fourth  order  central  compact  scheme  is  given  by  (1.2)  with 

(Au).  =  ^(vi-i -|-4v,  -|-v,+i) 
o 
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(Bo),  =  -  »i-,) 


(1.3) 


and  two  third  order  upwind  compact  schemes  are  given  by 

(Au)i  = 

{Bv)i  =  ^(3ui +  ri-2)  (1.4) 

and 

(An),  =  -  Ui+i) 

{Bv)i  =  ^(-u.+2  +  4t;,+i  -  3i;,)  (1.5) 

respectively,  depending  upon  the  wind  direction.  Notice  that  (1.4)  and  (1.5)  have  the  same 
implicit  part  A  which  is  symmetric.  This  fact  will  be  used  later  in  Section  2  to  define  our 
local  means. 

The  cost  of  compact  schemes,  regardless  of  the  number  of  space  dimensions,  involves 
only  inversion  of  the  narrowly  banded  (usually  tridiagonal)  matrix  A,  hence  is  comparable 
to  explicit  methods.  This  is  notably  different  from  other  implicit  methods  such  as  the 
continuous  Galerkin  finite  element  methods  in  multi  space  dimensions,  even  if  they  are 
similar  in  one  space  dimension. 

The  advantages  of  compact  schemes  over  traditional  finite  difference  methods  include 
the  relatively  high  order  of  accuracy  using  a  compact  stencil  (for  example,  the  fourth  order 
scheme  (1.3),  when  discretized  in  time  using  Euler  forward,  uses  only  a  three  point  stencil 
in  each  time  level),  a  better  (linear)  stability,  a  better  resolution  for  high  frequency  waves 
[13],  and  usually  fewer  boundary  points  to  handle.  In  recent  years  compact  schemes  have 
attracted  considerable  attention  in  various  fields  such  as  the  direct  numerical  simulations  of 
turbulence.  We  refer  the  readers  to  [13],  [19],  [12],  [3],  [1],  and  [2]  for  more  details.  The 
recent  paper  [13]  discusses  in  detail  wave  resolution,  phase  errors  and  other  issues  related  to 
compact  schemes  and  is  a  good  reference. 

We  are  interested  in  applying  compact  schemes  for  shock  calculations.  Just  like  any 
other  linear  schemes  (schemes  which  are  linear  when  applied  to  linear  equations),  compact 
schemes  usually  demonstrate  nonlinear  instability  when  applied  to  discontinuous  data.  We 
follow  the  TVD  (total  variation  diminishing)  ideas  in  [9],  [14]  and  try  to  define  a  suitable 
nonlinear  local  limiting  to  avoid  spurious  oscillations  while  keeping  the  formal  accur2w:y  of 
the  scheme.  Notice  that  the  compact  scheme,  just  like  any  implicit  scheme,  is  global.  That  is, 
the  approximation  to  f{u)x  at  x  =  involves  u*  along  the  whole  line  due  to  the  tridiagonal 
inversion  A~^.  Our  main  idea  is  to  define  a  local  mean,  and  to  use  it  as  a  reference  for 
introducing  a  local  limiting.  In  Section  2  we  introduce  the  limiting  for  one  space  dimension 
and  obtain  total  variation  stability.  In  Section  3  we  introduce  the  limiting  for  multi  space 
dimensions  and  obtain  maximum  norm  stability.  In  Section  4  we  present  numerical  examples. 
In  most  cases  we  will  only  state  the  theoretical  results  without  proof.  We  refer  the  readers 
to  [6]  for  details. 
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In  this  paper,  we  use  the  total  variation  diminishing  (TVD)  Runge-Kutta  type  time  dis¬ 
cretization,  introduced  in  [17],  [15],  to  discretize  the  ODE  in  the  method-of-lines  formulation 
(1.2).  In  the  third  order  case,  the  time  discretization  is 

=  u^  +  AtL(u’^) 

=  ju"  -f-  -f  (1.6) 

°  5“"  + 1“'”  + 

Only  third  order  results  will  be  shown,  although  schemes  with  other  orders  of  accuracy  are 
also  tested. 

These  special  Runge-Kutta  type  time  discretizations  are  labelled  TVD  because  it  can  be 
proven  that,  under  suitable  restrictions  on  the  time  step  At  (the  CFL  condition),  the  full 
discretization  (1.6)  is  TVD,  or  stable  under  another  norm,  for  example  the  Loo  norm,  if  the 
first  order  Euler  forward  time  discretization  for  (1.2): 

«"+»  =  u"  +  A<L(u")  (1.7) 

is  TVD  or  stable  under  the  other  norm.  For  detiuls,  see  [17]  and  [15]. 

We  thus  only  need  to  consider  the  Euler  forward  scheme  (1.7)  for  ?  ‘Kty  analysis  in  the 
subsequent  sections. 


2  One  Space  Dimension 

In  one  space  dimension,  equation  (1.1)  becomes 


«<  +  /(«)*  =  0 

u(a;,0)  =  u°(x) 

the  scheme  (1.2)  is 

and  the  Euler  forward  time  discretization  (1.7)  becomes 

+  AtL(u’’)i 

Scheme  (2.3)  can  be  easily  written  into  a  conservation  form 

At 


(2.1) 

(2.2) 

(2.3) 


(2.4) 


suitable  for  shock  calculations.  However,  the  numerical  flux  h?  ^  is  not  a  local  function  of 

1+2 

«2  due  to  the  tridiagonal  inversion  A~^.  If  we  define 


u,  =  (/lu), 


(2.5) 
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(2.6) 


then  scheme  (2.3)  can  be  left-multiplied  by  A  to  become 

sr’ = sr  - 

which,  when  written  into  a  conservation  form, 

involves  a  numerical  flux  which  is  a  local  function  of  u^.  For  example, 

/h)  =  5  (/(“«) +  /(“<))  (2-8) 

for  the  fourth  order  central  scheme  (1-3);  and 

/.+t  =  i(3/(u0-/(u.-,))  (2.9) 

and 

=  J  (-/(“«) +3/(ui+.))  (2.10) 

for  the  two  third  order  upwind  schemes  (1.4)  and  (1.5),  respectively.  Notice  that  scheme 
(2.7)  resembles  a  cell-averaged  (finite  volume)  scheme  [11].  The  tij  in  (2.5),  just  like  a  cell 
average,  is  a  local  mean  of  u,  defined  by  Au  in  (1.3)  through  (1.5).  Since  the  computation 
of  the  flux  fi^  in  (2.7)  involves  the  values  of  u,  a  “reconstruction”  from  ti  to  u 

Ui  =  (/l~^ii)i  (2.11) 


is  needed.  This  reconstruction  is  global. 

It  is  now  rather  straightforward  to  define  the  limiting.  We  first  write 


/(u)  =  /+(«) +  /-(«) 


with  the  requirement  that 


drju) 

du 


>0, 


(2.12) 


(2.13) 


The  purpose  of  this  flux  splitting  is  for  easier  upwinding  at  later  stages.  The  simplest  such 
splitting  is  due  to  Lax- Friedrichs 


1 


/  («)  =  o  (/(“)  ±  ««) »  «  =  !/'(«)  I 


(2.14) 


where  the  maximum  is  taken  over  the  range  of  «®(a;).  We  then  write  the  flux  in  (2.7) 
also  as 

(2.15) 

where  are  obtained  by  putting  superscripts  ±  in  (2.8)  through  (2.10). 
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Next  we  define 


=  /-(««) 


(2.16) 


Here  are  the  differences  between  the  numerical  fluxes  and  the  first  order,  upwind 

fluxes  and  /“(tij+i).  These  differences  are  subject  to  limiting  for  nonlinear  stability. 

We  define  the  limiting  by 


■'/i'r*  =  m(a/4i,A+r(8,),A+r(6.-.,)) 

4;'r’  =  (2.17) 


where  A+v,  =  u.+i  —  u,  is  the  usual  forward  difference  operator,  and  the  (now  standard) 
minmod  function  m  is  defined  by 


,r,(n  r,  \  -  I  ^  ™ni<.<it  |aj|,  if  sign{ai)  =  •  •  •  =  sign{ak) 

-  otherwise 

See,  e.g.,  [9].  Notice  that  the  limiting  defined  in  (2.17)  is  upwind  biased. 
The  limited  numerical  fluxes  are  then  defined  by 


=  s 


.^;r'  =  /n«.) + =  r(s.«)  -  <'r' 


and 


■'t+i  A+l  '  ■'•+1 
If  we  define  the  total  variation  of  the  mean  u  by 

ry(u)  =  5:i«.+,-«.| 


(2.18) 


(2.19) 

(2.20) 

(2.21) 


we  have  the  following 

Proposition  2.1:  Scheme  (2.7)  with  the  flux  (2.20)  is  TVDM  (total  variation  diminish¬ 
ing  in  the  means): 

ry  («"+*)  <  ry(u")  (2.22) 

under  the  CFL  condition 


max 

mini  u"<u<maxi  u" 


(r '(«)  -  /-'(»)) 


(2.23) 

□ 


The  limiting  defined  in  (2.17)  is  just  one  of  many  possibilities.  See  [18]  for  a  comprehen¬ 
sive  discussion  of  limiters. 

The  total  variation  stability  for  u  itself  is  based  upon  the  previous  proposition  and  the 
following  lemma: 
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Lemma  2.2:  If  there  are  two  numbers  0  <  6  <  1  and  a  >  0,  which  are  independent  of 
N,  such  that  the  N  x  N  matrix  A  =  (a,,)  satisfies: 

1  ^ 

m«  — :  <  a,  and  |a,_j|  <  6  \ajj\ ,  j  =  1,  •  •  • ,  AT  (2.24) 

i<j<N  |a^j|  .  _  ^ 

(strongly  diagonally  dominance  for  the  transpose  of  A),  then  the  L\  norm  of  A~^  is  bounded 
independent  of  N: 

l|A-‘|U.  <  (2.25) 

□ 


For  most  compact  methods,  the  matrix  A  satisfies  the  condition  (2.24)  for  Lemma  2.2. 
For  example,  in  the  schemes  defined  by  (1.3),  (1.4)  and  (1.5),  A  satisfies  the  condition  (2.24) 
with  6=|,a  =  6;6  =  |,a  =  3  and  6  =  |,  a  =  3,  respectively.  For  such  compact  schemes, 
we  have  the  total  variation  stability  for  u: 

Proposition  2.3:  If  a  compact  scheme  (2.7)  satisfies  the  conditions  in  Proposition  2.1 
and  Lemma  2.2,  then  it  is  TVB  (total  variation  bounded).  That  is. 


rV(u”) 


-  ti. 


<C 


for  all  n  >  0  and  At  >  0.  Here  C  is  a  constant  independent  of  n  and  At. 


(2.26) 


□ 


This  Proposition  guarantees  convergence  of  at  least  a  subsequence  of  the  numerical  so¬ 
lution. 

We  now  discuss  whether  the  limiting  defined  in  (2.17)  maintains  the  formal  accuracy  of 
the  compact  schemes  in  smooth  regions  of  the  solution.  For  this  we  need  the  following 
Assumption  2.4: 

u,  =  {Au)i  =  «,•  +  O(Ax^)  (2.27) 

for  all  u  €  C^. 


□ 


This  Assumption  is  satisfied  by  any  compact  scheme  with  a  symmetric  A,  for  example 
all  those  listed  in  (1.3)  through  (1.5). 

Under  Assumption  2.4,  it  is  easy  to  verify,  by  simple  Taylor  expansions,  that 
A+/*(ti*)  =  f^{ui)j:Ax  +  O(Ax^)  k  =  i  -  l,i,i  +  1 

+  0(Ax»)  (2.28) 

Hence  in  smooth  regions  away  from  critical  points  (critical  points  are  defined  here  as 
points  for  which  /■*■(«)»  =  0  or  /"(«)*  =  0),  the  second  and  third  arguments  of  the  minmod 
functions  in  (2.17)  are  asymptotically  of  the  same  sign  as  the  first  argument  and  half  in 
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magnitude.  Hence  the  first  argument  will  be  picked  by  the  minmod  function  (2.18)  for 
sufficiently  small  Ax,  yielding 

(2») 

This  guarantees  the  original  high  order  accuracy  of  the  scheme  in  such  smooth,  monotone 
regions,  due  to  the  exponential  decay  of  the  off-diagonal  entries  in  A~^  for  the  type  of  A  we 
use  [7].  At  critical  points,  the  accuracy  will  degenerate  to  first  order  as  a  generic  restriction  of 
all  TVD  schemes  (see,  for  example,  [14]).  To  overcome  this  difficulty,  we  use  a  modification 
of  the  minmod  function 


if  |ai|  <  MAx^ 
,ajk),  otherwise 


(2.30) 


where  M  is  a  constant  independent  of  Ax.  This  modification  is  discussed  in  detail  in  [16] 
and  [4]. 

With  this  modification  we  can  obtain  schemes  which  are  formally  of  uniform  high  order 
accuracy,  equalling  the  original  unlimited  scheme,  in  smooth  regions  including  local  extrema. 
Moreover,  we  can  prove  the  following 

Proposition  2.5:  The  conclusions  of  Proposition  2.1  and  2.3  are  still  valid,  for  any  n 
and  At  such  that  0  <  nAt  <  T,  with  TVDM  in  (2.22)  replaced  by  TVBM  (total  variation 
bounded  in  the  means): 

rV(u’‘)  <  C  (2.31) 

where  C  is  independent  of  At,  if  the  minmod  function  m  in  (2.17)  is  replaced  by  the  modified 
minmod  function  m  defined  in  (2.30). 


□ 

The  choice  of  the  constant  M  in  (2.30)  is  related  to  the  second  derivative  of  the  solution 
near  smooth  extrema.  For  details,  see  [16]  and  [4].  The  numerical  result  is  usually  not 
sensitive  to  the  variation  of  M  in  a  large  range. 

3  Multi  Space  Dimensions 

For  notational  simplicity  we  only  consider  the  two  dimensional  case  (1.1)-(1.2).  Three  space 
dimensions  do  not  pose  additional  conceptional  difficulties.  As  before,  we  only  need  to 
consider  the  Euler  forward  time  discretization 

u^;^=u^  +  AtL{u^)ij  (3.1) 

We  again  define 

Uij  =  {AyAxu)ij  (3.2) 

so  that  scheme  (3.1)  can  be  left-multiplied  by  AyA^  to  become 

C’  =  (3.3) 
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Here  and  in  what  follows  we  will  use  the  commutativity  of  Ag,  Ay,  Bg  and  By  so  that  a 

product  can  be  written  in  any  order.  Scheme  (3.3)  can  be  written  into  a  conservation  form 

=  ««  -  (3-<) 

which  involves  numerical  fluxes  and  as  local  functions  of  Uy.  For  example, 

Qij^i  =  ^A,(5(u<j+i)  +  i?(u<,))  (3.5) 

for  the  fourth  order  central  scheme  (1.3),  etc..  Again,  scheme  (3.4)  resembles  a  cell-averaged 
(finite  volume)  scheme  [lOl.  The  u,j  defined  by  (3.2)  is  a  local  mean  of  u,  and  a  “reconstruc¬ 
tion”  from  u  to  u 

“u  =  (3-6) 

is  needed  to  compute  the  fluxes  fi+y  in  (3.4). 

We  remark  that  the  additional  costs  of  implementing  scheme  (3.4),  comparing  with  the 
original  scheme  (3.1),  are  the  two  local  operators  Ax  and  Ay.  The  major  part  of  the  cost  still 
consists  of  the  two  tridiagonal  inversions. 

The  limiting  to  obtain  nonlinear  stability  can  now  be  defined  in  a  dimension  by  dimension 
fashion:  we  can  use  the  one-dimeasional  flux  splitting  (2.12),  for  /(u),  to  write  the  flux 
as 

=  (3-7) 

where  are  again  obtained  by  putting  superscripts  ±  in,  e.g.,  (3.5).  The  remaining 

definition  of  the  limiting  parallels  that  in  Section  2,  with  a  dummy  index  j  added  for  the 
reference  y  value:  We  still  start  with  the  differences  between  the  high  order  numerical  fluxes 
and  the  first  order  upwind  fluxes 


=  =  /-(«<«.) -/-.j,  (3.8) 

limit  them  by 

<‘rJ  = 

3/74”j  =  (3.9) 

where  =  Vj+ij  —  is  the  forward  difference  operator  in  the  x  direction,  and  the 

minmod  function  m  is  defined  by  (2.18).  We  then  obtjun  the  limited  numerical  fluxes  by 


limit  them  by 


itii]  =  + <Ti' 

f(”*)  _  /+("*)  ,  r-(m) 


(3.10) 


(3.11) 


2C2 


The  flux  in  the  y-direction  is  defined  analogously. 

In  light  of  [8]  this  scheme  cannot  be  TVD  in  two  space  dimensions.  However  we  can 
obtain  maximum  norm  stability  through  the  following 

Proposition  3.1:  Scheme  (3.4)  with  the  flux  (3.11)  satisfies  a  maximum  principle  in  the 
means: 

luy-*  I  <  |tir,  I  (3.12) 

under  the  CFL  condition 

[max  (/+'(m))  +  max  (-/"'(«))]  ^  +  [max  (y+'(u))  +  max  (-y"'(u))]  ^  ^  (3.13) 

where  the  maximum  is  taken  in  min,  j  <  «  <  max,  j  u^. 

□ 

In  order  to  obtain  maximum  norm  stability  for  u,  we  need  a  lemma  similar  to  Lemma 

2.2: 

Lemma  3.2: 

If  there  are  two  numbers  0  <  6  <  1  and  a  >  0,  which  are  independent  of  N,  such  that 
the  N  X  N  matrix  A  =  (a,j)  satisfies: 

1  ^ 

^  KI<^k.U  (3.14) 

j  =  1 

3 

(strongly  diagonally  dominance  for  A),  then  the  Loo  norm  of  is  bounded  independent 
of  N: 

ll^l-'IU.  <  (3.15) 

□ 

For  the  compact  methods  we  consider,  the  matrix  A  is  symmetric.  Hence  the  requirements 
(2.24)  and  (3.14)  are  the  same. 

We  can  now  use  Lemma  3.2  to  obtain  the  maximum  norm  stability  for  u: 

Proposition  3.3:  If  a  compact  scheme  (3.4)  satisfies  the  conditions  in  Proposition  3.1 
and  Lemma  3.2  for  both  A,  and  Ay,  then  it  is  stable  in  the  maximum  norm.  That  is, 

n:i«pc|u^|<C7  (3.16) 

for  all  n  >  0  and  At  >  0.  Here  C  is  a  constant  independent  of  n  and  At. 

□ 

This  Proposition  does  not  guarantee  convergence,  but  it  at  least  guarantees  that  the 
numerical  solution  will  not  blow  up  due  to  instability. 

Under  the  Assumption  2.4  for  both  A*  and  Ay,  we  can  again  easily  verify  that  the 
limiting  (3.9)  maintains  formally  the  original  high  order  accuracy  of  the  scheme  in  smooth, 
monotone  regions.  The  degeneracy  of  accuracy  at  critical  points  can  once  again  be  overcome 
by  adopting  the  modified  minmod  function  (2.30)  in  the  limiting  (3.9). 
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4  Numerical  Examples 


To  test  the  behavior  of  the  schemes  discussed  in  Sections  2  and  3,  we  use  the  one  and  two 
dimensional  Burgers  equation  with  smooth  initial  conditions: 


u(x,0) 


0 

0.3  +  0.7  sin(a:) 


(4.1) 


and 


u(x,y,0) 


0 

0.3  +  0.7  sin(x  +  y) 


(4.2) 


both  with  27r-periodic  boundary  conditions.  The  solutions  will  stay  smooth  initially,  then 
develop  shocks  which  move  with  time.  The  exact  solution  to  (4.1)  can  be  obtained  by  follow¬ 
ing  the  characteristics  and  solving  the  resulting  nonlinear  equation  using  Newton  iteration. 
The  exact  solution  to  (4.2)  is  that  of  (4.1)  with  x  replaced  by  x  -f  y  and  t  replaced  by  2t. 
These  are  standard  test  problems  for  scalar  nonlinear  conservation  laws  containing  shocks. 
For  comparison  with  finite  difference  ENO  schemes  find  with  finite  element  discontinuous 
Galerkin  methods,  see  [17],  [4]  and  [5]. 

The  schemes  we  test  are  based  on  the  third  order  upwind  schemes  (1.4)-(1.5)  coupled 
with  the  third  order  TVD  Runge-Kutta  time  discretization  (1.6)  (henceforth  referred  to  as 
the  upwind  scheme).  For  the  flux  splitting  (2.12)  we  use  the  Lax- Friedrichs  splitting  (2.14). 
The  time  step  At"  is  taken  to  satisfy  a  CFL  condition 

At" 

max  |u]*l  -T —  <  0.5  (4.3) 

I  Ax 


in  one  dimension  amd 


-n  I 

i  A? 


<0.5 


in  two  dimensions.  When  the  modified  minmod  limiter  (2.30)  is  used,  the  constant  M  is 
taken  as  1. 

We  first  test  the  effect  of  limiters  when  the  solution  is  smooth  but  not  monotone.  In 
Figure  1  we  plot  the  Li  error  versus  number  of  grid  points,  in  a  log-log  scale,  at  t  =  0.6  for 
the  one  dimensional  case  and  at  t  =  0.3  for  the  two  dimensional  case.  In  such  scales,  the 
error  should  be  a  straight  line  with  slope  —  fc  for  a  fc-th  order  method.  We  can  see  that  the 
original  compact  schemes  and  the  schemes  with  modified  minmod  limiter  (2.30)  (henceforth 
referred  to  as  the  TVB  limiter)  give  the  expected  third  order  accuracy,  while  the  schemes 
with  the  minmod  limiter  (2.18)  (henceforth  referred  to  as  the  TVD  limiter)  give  only  second 
order  £w:curacy  due  to  the  degeneracy  at  the  critical  points. 

We  then  test  the  effect  of  limiters  when  the  solution  becomes  discontinuous.  In  Figure 
2  we  show  the  results  of  the  original  compact  schemes  at  t  =  2  for  the  one  dimensional 
case,  as  well  as  the  result  obtained  with  the  TVB  limiter  (the  result  obtained  with  the  TVD 
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limits  is  gr&phically  similar  to  that  obtained  with  the  TVB  limiter,  hence  is  not  shown). 
We  can  see  over-  and  imder-shoots  for  the  original  compact  scheme,  and  monotone  shock 
transition  for  the  result  obtained  with  TVB  limiter.  In  Figures  3,  we  show  the  pointwise 
errors,  in  a  logarithm  scale,  for  the  numbers  of  grid  points  N  =  10, 20, 40, 80  and  160.  We 
can  see  that  the  error  behaves  as  expected,  with  bigger  errors  for  the  TVD  limiter  near  the 
smooth  extremum  which  is  close  to  the  shock.  The  errors  for  the  two  dimensional  case  are 
similar  and  are  not  displayed.  In  the  last  picture.  Figure  4,  we  show  the  surface  of  the  two 
dimensional  solution  at  t  =  1  with  40  x  40  points  using  the  third  order  upwind  method  with 
TVB  limiting. 
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Figure  2:  Compact  schemes  for  shocks.  Plus  signs:  computed  solution;  solid  line:  exact 
solution. 

2(a):  Third  order  upwind  scheme 


2(b):  Third  order  upwind  scheme  with  TVB  luniter 
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FigiOT  3:  Pointwiae  «ior  for  iV  =  10, 20, «, 80  and  160  grid  points,  in  a  fogarithm  scale. 
Third  order  upwind  scheme  with  limiters  for  shocks. 

3(a):  With  TVD  limiter 
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ABSTRACT 

Wind  tunnel  testing  of  Army  temper  tents,  used  in  the  Chemically  Protected 
Deployable  Medical  System  (CP  DEPMEDS),  is  simulated  using  a  two-dimensional, 
staggered  grid,  finite  difference  approach.  This  numerical  wind  tunnel  approach  is  used  to 
predict  pressure  data  to  determine  lift  and  drag  characteristics  of  the  tents.  Wind  tunnel 
blockage  due  to  the  tents  is  less  than  10%  and  wind  speeds  range  from  S  to  70  mph 
(4.6  X  10*  ^  Re  ^  6,4  X  1(P).  Length  to  height  ratios  for  the  tents  are  6.4  and  9.6,  for 
19.51  m  (64  ft)  and  29.26  m  (96  ft)  length  tents,  respectively.  Values  for  all  the  forces  are 
reported  for  the  19.51  m  (64  ft)  length  tent.  Although  not  reported,  the  values  for  the  29.26 
m  (96  ft)  tent  length  are  larger  by  a  factor  of  1.5.  Drag  and  lift  coefficient  values  are  area 
specific,  therefore,  these  values  are  valid  for  all  tent  lengths  where  the  two  dim^sional 
assumption  is  valid. 


ODUCTION 


U.  S.  Army  Belvoir  RD&E  Center  (BELVOIR)  is  continuing  to  support  U.  S.  Army 
Natick  RD&E  Center  (NATICK)  in  the  C-100  Air  Conditioner  program  for  the  Chemically 
Protected  Deployable  Medical  System  (CP  DEPMEDS).  As  a  part  of  that  work,  BELVOIR 
is  conducting  a  study  of  the  airflow  and  drag  characteristics  of  the  tents  to  aid  NATICK  in 
the  development  of  an  improved  tent  anchorage  system.  The  magnitude  of  the  forces  needed 
to  design  the  anchorage  system  are  calculated  by  multiplying  the  pressure  differences  across 
the  tent  by  the  surface  area.  Values  for  all  the  forces  reported  in  this  study  are  for  19.51  m 
(64  ft)  length  tents.  Corresponding  values  for  29.26  m  (96  ft)  length  tents  are  larger  by  a 
factor  of  1.5.  Drag  and  lift  coefficients  are  area  specific  and  hence,  are  valid  for  all  tent 
lengths  where  the  two  dimensional  assumption  is  valid.  This  study  includes  the  flow 
characteristics  of  a  single  tent.  Future  work  will  include  multiple  tents  positioned  downwind. 
The  tents  are  much  longer  in  length  than  in  width  or  height  and  for  CP  DEPMEDS  the  tents 
are  positioned  parallel  to  one  another  and  connected  at  one  end  by  a  narrow  passageway. 

'Supported  by  U.S.  Army  Natick  Research  Development  and  Engineering  Center. 
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BACKGROlINn 


A  study  of  wind  loads  on  the  Chemically  Protected  Deployable  Medical  System  (CP 
DEPMEDS)  tents  is  needed  by  U.S.  Army  Natick  Research  Eievelopment  and  Engineering 
Center  (NATICK)  to  assure  the  correct  development  of  improved  tent  anchorage  systems. 
Conventional  wind  tunnel  testing  is  an  expensive  and  time  consuming  method  to  obtain  wind 
load  data.  An  alternative  and  cost-cutting  approach  is  the  use  of  a  numerical  wind  tuimel, 
where  the  wind  tunnel  is  simulated  on  a  computer.  This  inexpensive  and  time-saving 
approach  is  swiftly  gaining  popularity  due  to  the  increasing  affordability  of  computers  and 
improved  computational  fluid  dynamics  codes. 

In  this  study,  a  numerical  wind  tunnel  is  used  to  obtain  predictions  of  velocities  and 
pressures  acting  on  CP  DEPMEDS  tents  (Army  temper  tents)  when  wind  speeds  of  5,  20, 

50,  and  70  mph  strike  the  upwind  side  of  the  tents  at  a  90  degree  angle  of  attack  (broadside). 

PROCEDURE 

The  DEPMEDS  tent  is  modeled  as  a  rigid  wall  building  with  a  peaked  roof.  In 
reality,  wind  will  deform  the  tent  wall  into  varying  shapes.  Also,  pressures  inside  and 
outside  of  the  tent  will  be  transient  when  the  tent  materials  flap  and  vibrate  due  to  the  wind. 
Predicting  these  complex  mechanisms  is  beyond  the  scope  of  this  study.  The  following 
procedures  were  used  to  obtain  values  for  engineering  calculations  leading  to  the  design  of  a 
tent  anchorage  system  and  are  not  meant  to  be  a  transient  simulation  of  the  complex 
mechanisms  involved.  The  geometrical  model  of  the  peaked  roof  tent  is  built  on  a  one  to 
one  scale,  using  small,  stair  step  blocks  to  build  the  inclined  roof.  As  it  turns  out,  the  stair 
step  shape  of  the  roof  does  not  have  a  major  effect  on  the  lift  and  drag  forces  on  the  tent 
because  most  the  flow  over  the  upwind  and  downwind  roof  section  is  separated  from  the  roof 
surface.  Hence  the  core  flow  makes  little  contact  with  the  roof  surface.  The  height  of  the 
vertical  tent  walls  is  1.98  m  and  the  lieight  of  the  peaked  roof  is  1.22  m  making  the  total  tent 
height  from  ground  to  peak  3.2  m.  Wind  tunnel  height  is  55  m  and  the  wind  tunnel  blockage 
is  6%,  within  the  recommended  value  of  10%  or  less  (ASHRAE,  1989).  Upwind  of  the  tent 
is  29.3  m  of  level  ground.  Downwind  of  the  tent  is  39  m  of  level  ground. 

Tent  width  is  assumed  a  constant  value  of  6. 1  m.  This  constant  value  will  produce 
errors  at  low  wind  spee;ds  due  to  the  slightly  rounded,  upwind  wall  of  the  pressurized  tents. 
The  largest  error  will  he  produced  by  the  sharp  edge  of  the  upwind  wall  eves  in  the  model 
geometry.  However,  at  high  wind  speeds,  the  wind  will  diminish  the  roundness  of  the  actual 
wall  because  the  outside  pressure  is  equal  to  or  exceeds  the  inside  pressure.  Roundness  of 
the  walls  is  shown  in  Figures  1  and  77  of  the  photograph  section  of  the  final  report  of  the 
Customer  Engineering  Design  Test  (Phase  II)  at  Fort  Indiantown  Gap  (Cho  and  Bryant, 

1993).  At  wind  speeds  above  16  m/s  (36  mph),  the  upwind  wall  is  expected  to  form  a 
concave  shape  that  increases  with  windspeed.  The  effect  of  this  concavity  on  drag  and  lift 
will  be  included  in  future  work. 
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Uniform  velocity  profiles  of  magnitudes  of  2.24  m/s  (5  mph),  8.94  m/s  (20  mph), 
22.4  m/s  (SO  mph),  and  31.3  m/s  (70  mfA)  are  prescribed  at  the  wind  tunnel  inlet.  The  flow 
is  assumed  to  be  incompressible  because  the  largest  Mach  number  is  0.09,  which  is  much 
smaller  than  Ma=0.3  when  compressibility  becomes  significant.  Also,  laminar  flow 
simplifications  are  used  to  include  some  viscous  effects  without  the  penalty  of  large 
computational  times  associated  with  turbulent  solutions.  Turbulent  solutions  may  be  used  for 
more  detailed,  future  studies  if  NATICK  desires  these  results.  Turbulent  solutions  typically 
double  computational  time.  The  inclusion  of  viscous  forces  allows  the  recirculation  r^ion, 
located  at  the  bottom,  upwind  side  of  the  upwind  tent,  to  be  modeled  (Figure  1).  Viscous 
forces  reduce  the  approaching  wind  speed  near  the  ground,  produce  the  recirculation  region, 
and  determine  the  height  of  the  stagnation  line.  Figure  2  shows  the  inviscid  solution,  with 
the  absence  of  the  recirculation  region,  as  compared  to  the  viscous  solution  in  Figure  1. 

Two  dimensional,  steady  state,  conservation  equations  in  Cartesian  coordinates  are  presented 
below.  The  conservation  of  mass  is  expressed  as 


d(pu)  ^  d(pv)  _Q 
dx  dy 


[1] 


where  terms  are  defined  in  the  Nomenclature  section.  The  constant  density  terms  are  treated 
as  variables  due  to  the  general  nature  of  the  code  CFD2000  1.0,  a  version  of  PHOENICS 
1.5,  which  is  chosen  for  this  study  (Phoenics,  1988).  The  following  equations  for  the 
conservation  of  momentum  in  the  x  and  y  directions  are  linearized  and  solved. 


d(puu)  ^  d(pvu) 
dx  dy 


dx  dx^^ 


LAM 


du  \  d  / 


du. 

dy 


[2] 


djpuv)  ^  3(pvv) 
dx  dy 


dy  dx 


dx  dy 


LAM 


dy 


[3] 


To  link  pressure  and  flow,  CFD2000  uses  Semi-Implicit-Method  for  Eressure-Linked- 
Equations  (SIMPLE),  developed  by  Patankar  (1980),  in  conjunction  with  the  staggered  grid 
method.  The  staggered  grid  method  is  used  to  prevent  oscillations  of  the  pressure  field.  As 
a  result,  pressures  are  calculated  at  cell  nodes  and  velocities  are  calculated  at  cell  faces. 

Horizontal  grid  resolution  is  1.63  m/cell  in  region  1,  0.24  m/cell  in  region  2,  and 
1.63  m/cell  in  region  3  (Figure  3).  Vertical  grid  resolution  is  0.25  m/cell  in  region  A,  0.17 
m/cell  in  region  B,  and  2.5  m/cell  in  region  C  (Figure  3).  The  discretized  forms  of 
equations  1,  2,  and  3  are  solved  at  4,234  cell  nodes,  totalling  12,702  simultaneous  equations. 
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ON  ARMY  TEMPER  TENT. 
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FIGURE  2.  INVISCID  SOLUTION  RESULTING  FROM  EyLER’S  EQUATIONS. 
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FIGURE  3.  REGIONS  OF  DISCRETIZATION. 


REsmjs 


Solutions  converge  within  2500  to  3000  iterations  and  graphical  examples  of  the 
results  are  in  Figures  4  and  5.  Aj^roximately  three  hours  of  computer  run  time  is  required 
for  convergence  using  a  486  personal  computer  operating  at  33  MHZ.  The  32  bit  Lahey 
F77L-EM/32  FORTRAN  Compiler  version  4.02  with  the  Lahey  Ergo  OS/386  DOS  Extrader 
is  used. 

Numerical  wind  tunnel  results  are  interpreted  using  graphs  that  show  velocity  vectors 
and  pressure  contours.  The  direction  of  the  velocity  vectors  is  indicated  by  the  angle  of  the 
arrows  and  the  magnitude  of  the  vector,  in  m/s,  is  indicated  by  the  length  of  the  arrow. 
Pressure  magnitudes,  in  pascals,  are  indicated  as  varying  shades  of  black  where  the  pressure 
increases  as  the  shade  lightens.  Values  assigned  to  the  shaded  contours  are  printed  in  the 
legend  located  on  the  right  side  of  the  pressure  contour  plot.  An  example  of  how  to  interpret 
the  pressure  contour  predictions  is  to  note  the  light  shade  of  the  contour  on  the  lower  upwind 
side  of  the  tent  and  then  note  the  darker  shade  contour  on  the  lower  downwind  side.  Noting 
the  pressure  shaded  legend  on  the  right  side  of  the  page,  the  pressure  values  may  be  found; 
and  the  difference  between  the  two  values  is  the  maximum  pressure  drop  across  the  tent. 

The  individual  pressures  are  relative,  not  absolute,  due  to  the  dP/dx  and  dP/dy  terms  in  the  x 
and  y  momentum  equations.  Hence,  the  difference  between  two  relative  pressures  yields  the 
correct  pressure  difference. 

Figures  4  and  5  are  representative  of  the  flow  patterns  and  pressure  contours  over  the 
computational  domain.  Three  main  flow  regimes  exist  and  these  include  separated  flow, 
recirculation,  and  reattachment  (Figures  4,  5,  6  and  7).  Flow  separates  near  the  peak  of  the 
roof  and  recirculates  on  the  downwind  side  of  the  roof  and  tent  wall.  The  separated  flow 
reattaches  to  the  ground  downwind,  and  these  distances  and  ratios  (reattachment  length/tent 
height)(L/H)  are  given  in  Table  1.  Objects  located  in  the  separation  region  are  shielded  fiom 
wind  loads  and  are  less  likely  to  be  blown  around  or  damaged  during  high  wind  gusts. 

Height  of  the  separation  region  decreases  downwind;  therefore,  the  tallest  objects  to  be 
shielded  should  be  positioned  closest  to  the  downwind  wall. 
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Drag  and  lift  forces  and  the  corresponding  coefficients  are  given  in  Table  1  and  are 
important  for  tent  anchorage.  The  drag  and  lift  coefficients  versus  wind  speed  and  Reynolds 
Number  are  a  constant  value  (Table  1)  and  are  defined  as 


Drag 

l/2pV^A 


[4] 


,  _  Lift 
^  '  l/2pV^A 


[5] 
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FIGURE  7.  MAGNIFIED  VIEW  OF  SEPARATION  AND  RECIRCULATION  REGIONS 


Hence,  drag  and  lift  may  be  calculated  for  any  wind  speed  between  5  and  70  mph.  As 
equations  4  and  S  and  Figure  8  show,  drag  ai^  lift  increase  quadratically  with  respect  to 
wind  speed.  Drag  is  calculated  by  summing  the  products  of  the  pressure  differences,  across 
the  tent  width  by  the  cell  areas  of  the  upwind  wall  and  the  vertical  projection  of  the  roof. 
Drag  is  the  horizontal  force  that  the  tent  anchorage  system  must  support. 

Table  1.  Drag,  Lift  and  Reattachment  Parameters  as  a  function  of  Wind  Speed  and 
Reynolds  Number  (based  on  tent  height). 


Wind  Speed 
km/h  (mph) 

8.0  (5) 

32.2  (20) 

80.5  (50) 

112.6(70) 

Reynolds  No. 

4.56x10* 

1.82x10® 

4.56x10® 

6.39x10® 

Drag  Coeff 

0.78 

0.77 

0.77 

0.77 

Lift  Coeff 

0.45 

0.45 

0.44 

0.44 

Drag  N  (Ibf) 

140  (30) 

2260  (510) 

14140  (3180) 

27580  (6200) 

mamm 

2500  (560) 

15470  (3480) 

30110  (6770) 

Reattachment 
Distance  m  (ft) 

’>6.2  (119) 

36.5  (120) 

37.1  (122) 

37.4  (123) 

Ratio  L/H 

11.3 

11.4 

11.6 

-  . 

Lift  is  calculated  by  summing  the  products  of  the  pressure  differences  across  the 
height  of  the  tent  by  the  cell  areas  of  the  horizontal  projection  of  the  roof.  The  pressure  at 
the  bottom  of  the  tent  is  assumed  to  be  at  atmospheric  pressure.  Flow  separation  at  the 
leading  edge  of  the  roof  (upwind  eves)  produces  a  low  pressure  region  that  tends  to  lift  the 
upwind  portion  of  the  tent  (Figure  7).  However,  the  prominent  flow  separation  region  exists 
over  the  downwind  roof  section,  and  it  produces  a  low  pressure  region  that  tends  to  lift  the 
downwind  portion  of  the  tent.  Lift  is  the  vertical  force  that  the  anchorage  system  must 
support. 

Forces  Tending  to  Cause  Roof  and  Wall  Blowout 

During  the  Customer  Engineering  Design  Test  (Phase  II)  that  was  conducted  during 
the  summer  of  1992,  the  U.S  Army  Combat  Systems  Test  Activity  (CSTA)  reported  several 
occurrences  of  wall  blowout  during  pressurization  (Cho,  1^92).  Wall  blowouts  were 
prevented  by  insuring  that  the  overpressure  value  did  not  exceed  0.6  inches  of  water  (IW). 

In  a  no  wind  condition,  the  CP  DEPMEDS  recommended  overpressure  of  0.6  IW  produces  a 
5770  N  (1300  IbO  force  acting  perpendicular  to  each  1.98m  x  19.51m  upwind  and  downwind 
wall.  The  pressure  difference  across  all  the  walls  and  the  roof  are  equal.  However,  under 
wind  load  conditions,  the  pressure  difference  across  e  tent  walls  and  roof  varies 
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around  the  tent,  with  the  minimum  value  existing  on  the  upwind  wall  and  the  maximum 
value  existing  on  the  downwind  wall.  The  maximum  pressure  difference  occurs  mi  the 
downwind  side  because  a  relatively  high  pressure  exists  in  the  tent  and  a  relatively  low 
pressure  exists  outside  the  tent,  llie  difference  between  the  two  pressures  is  relatively  large. 
Resultant  forces  on  the  downwind  wall  due  to  tent  pressurization  and  wind  loads  are  given  in 
Table  2  and  Figure  8.  The  downwind  wall  is  the  section  most  vulnerable  to  blowout. 

The  downwind  roof  section  is  the  next  most  vulnerable  section  to  blowout.  In  a  no 
wind  condition,  the  overpressure  produces  a  9575  N  (2150  Ibf)  force  acting  perpendicular  to 
each  roof  section.  Under  wind  load  conditions,  the  projection  of  the  lift  forces  perpendicular 
to  the  roof  plus  the  overpressure  force  of  9575  N  combine  to  form  a  larger  force  that  tends 
to  blowout  the  downwind  roof  section  (Table  1  and  Figure  8).  Resultant  forces  on  the 
downwind  roof  section  are  given  in  Table  2  and  Figure  8. 

Table  2.  Forces  Normal  to  the  Downwind  Roof  and  Wall  Sections. 


Wind  Speed  (mph) 

Force  Normal  to  Downwind 
Roof  Section  N  (Ibf) 

Force  Normal  to 

Downwind  Wall  N  (Ibf) 

0 

9575  (2150) 

5770  (1300) 

10100  (2270) 

5830  (1310) 

i  20 

1 14o0  (2580) 

6490  (1460) 

50 

19024  (4280) 

10150  (2280) 

70 

27550  (6190) 

14280  (3210) 

CONCLUSIONS 


Drag  and  lift  forces  are  predicted  for  5,  20,  50  and  70  mph  wind  speeds.  The 
constant  values  for  the  drag  and  lift  coefficients  allow  drag  and  lift  to  be  calculated  for  any 
wind  speed  between  5  and  70  mph  and  for  any  length  tent  where  the  two  dimensional 
assumption  is  valid.  Also,  Figure  8  may  be  used  to  predict  the  forces  tending  to  blow  out 
the  downwind  roof  and  wall  sections.  These  forces  will  aid  in  the  development  of  stronger 
seams  for  the  NBC  tent  liner.  Forces  are  recorded  to  provide  data  for  development  of 
improved  tent  anchorage  systems. 


FUTURE  WORK 


BELVOIR  is  presently  predicting  air  flow  and  drag  characteristics  of  multiple  CP 
DEPMEDS  tents  positioned  downwind  (Figures  9  and  10).  Introduction  of  a  second  tent 
doubles  computational  time,  hence,  Euler  equations  for  inviscid  flow  are  used  to  reduce 
computational  time. 
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FIGURE  9.  VELOCITY  VECTORS  OF  TWO  TENTS  IN  PARALLEL. 
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Prdiminary  results  indicate  that  the  outer  two  tents  of  the  CP  DEPMEDS  complex 
block  the  air  from  the  inner  tents.  The  requirement  for  expensive,  heavy  duty,  anchorage 
equipment  may  be  limited  to  the  outer  tents.  The  iimer  tents  can  be  fitted  with  less 
expensive,  lighter,  anchorage  equipment.  This  helps  to  reduce  cost  and  weight  of  the 
anchorage  system. 

Also,  the  slower  air  flowing  between  tents  may  function  like  a  pneumatic  separator  in 
sandy,  desert  regions  where  dust  and  sand  storms  are  common.  The  particles  being 
transported  in  the  separated  regime  could  settle  between  tents  as  they  contact  the  slower 
recirculation  regimes  between  tents  (Figure  9).  This  could  produce  sand,  dust,  or  snow 
drifts. 

Finally,  we  expect  the  high  pressure  that  exists  on  the  upwind  wall  of  the  upwind  tent 
and  the  low  pressure  that  exists  between  tents  to  combine  and  defeat  the  overpressure  system 
for  wind  speeds  above  16  m/s  (36  mph).  This  will  cause  contaminants  to  enter  the  upwind 
tent  and  quickly  flow,  by  convective  transport,  into  each  of  the  downwind  tents.  This  will 
happen  because  the  downwind  tents  are  engulfed  in  the  low  pressure  region  shown  in  the 
shaded  area  in  Figure  10.  Convective  transport  is  expected  to  spread  the  contaminants  from 
the  upwind  tent  to  the  downwind  tents  so  quickly  that  medical  personnel  will  not  have  time 
to  isolate  the  downwind  tents. 
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NOMENCLATURE 


p  »  air  density 


V  »  air  velocity 


X,  y,  r  =  cartesian  coordinates 


u,  V,  w  -  velocity  components 


d  ~  differential  operator 


P  =  pressure 


=  laminar  viscosity 


A  =  projected  area 
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IHREE-DIMENSIOIML  RNTIB  EUEMEKra  WnH  MULTIPLE^jUAI^ 


WiDf  Kam  Lki.  Ytt-Kan  Hn  ud  Ted  Bdytaddoo 

Departmeitt  of  Mechanical  Engiiieeriiig 
Nbctbwesicni  UniveisiQr 
Evanston,  IL  60208 

ABSTRACT.  New  miil^le-qua<bature-point  underintegrated  finite  elenientt  with 
hott^ilattcoiMnd  aie  developed.  The  elements  aie  onderiniegrated  to  avoid  vtdunelric  and  shear 
lod^  and  save  computational  tune.  An  ^>proach  for  hourglass  ctmtrol  is  proposed  sudi  that  the 
stahiliaBinn  operators  aie  obtained  siniply  by  taking  the  partial  derivatives  or  die  generalised  strain 
rate  vector  with  respect  to  the  natural  coordinates  so  tnat  the  elements  require  no  stabilization 
parameter.  To  improve  accuncy  over  the  traditional  one-point-quadrature  elements,  aevenl 
quadrature  p<^ts  are  used  to  integrate  the  internal  fences,  e^ecially  for  uacin||  the  plastic  fronts 
in  the  mesh  during  loading  and  unloading  in  ela^-plastic  analysis.  Four-pc^t  quadrature  are 
pn^ioaed  for  use  in  the  two  and  three  dimensional  elements.  Other  multiple  quadrature  pmnts  can 
also  be  employed.  Several  numerical  examp^  such  as  thin  beam,  plam  and  shell  pnAilems  are 
presented  to  draonstrale  the  i^lical^ty  of  the  pn^xised  elements. 

INTRCX^UCnCH^J.  In  large  scale  finite  element  analys^  thousands  of  elements  and  la^ 
computer  memory  demands  are  requiied  to  obtain  the  detailed  information  for  engineeiiim  design 
or  process  control  In  these  analyses,  computadtxud  costs  are  mosUy  determined  w  the  miciency 
the  elements,  especially  for  nonlinear  problems.  Perhaps,  the  most  efficient  dements  are  the 
one-point-qttadfature  elenients  with  hotm^ass  cootrd  devdoped  by  Flanaipm  and  Bdyttdiko  [1], 
Bdytsdiko  [2]  and  Bd^hko  et  oL  [3].  The  mesh  instability  associated  with  the  under-integniied 
elements  is  controlled  by  adding  a  stabilization  to  the  one-point  quadrature  element  The 
stabilization  terms  are  obtiuned  by  ensuring  the  consistency  of  the  finite  clement  equations  and  its 
magnitude  is  controlled  by  a  user  control!^  stabilizatitm  parameter.  Liu  and  Bdytsdiko  [4]  also 
develop  a  one-point-quadratrure  element  for  bed  conduction  problems.  In  this  w(^  the 
stabilization  paruneter  u  determined  by  solving  an  eigenvalue  proU^  The  rdationships  between 
the  stabilizaaon  parameto'  and  finite  difference  formulae  and  full  integr^on  finite  elements  are 
discussed  in  the  same  paper,  hi  Bdytschko  etaL  [5],  the  Hu-Washizu  variational  pcinci|de  is  used 
to  examine  the  rai^tude  d  the  stabilization  parameters.  More  recently,  an  assumed  strain 
stabilization  of  the  rour-node  quadrilateral  elemrat  and  the  dght-node  hexdiedral  ekmmit  with 
(me-point-qaadratore,  where  die  stabilization  parameters  are  not  required,  was  proposed  by 
Belytsdrio)  md  Kndeman  [6,71. 

An  ahemative  apiwoach  for  hourglass  control  is  proposed  by  Liu  et  al.  [8],  in  which 
resulting  stabilization  matrix  requires  no  stabilization  panuneter.  It  is  shown  that  ^  stabilizatitm 

vector  yean  be  obtained  simply  by  taking  the  partial  derivatives  of  the  generalized  strain  vecuv 
with  respect  to  the  natural  coordinates.  The  strain  vector  is  therefore  approximated  by  the 
combination  of  a  constant  part  and  other  parts  involving  strain  derivatives.  However,  ^ear- 
related  locking  phenomena  are  not  taken  into  consideration  wd  no  three  dimensional  result  is 
rqxirted  in  thew  study.  Anodm’  technique  is  the  so-called  directional  reduced  integration  propo^ 
by  k(di  and  Kflcuchi  [9]  based  on  the  procedure  given  in  Liu  et  al.  [8].  fo  cmitrast  to  selective 
r^uced  integration,  where  certain  parts  of  internal  virtual  work  are  underintegrated  unifmmly  in 
all  directkms,  die  directiomd  reduced  integration  undnintegrates  in  certain  dire^ons.  Numerical 


This  reseaidi  is  sponsor^  by  an  ARO  Grant  number  DAAL03-91-G-0016  and  Narkmal 
Center  for  Superctmiputing  Applications  at  Uibana-Champaign. 
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nrimiilfit  ihow  dm  tkit  techiiiqiie  is  effiKdve  for  two  duamskmal  proUems.  The  suUkxs  also 
extend  te  saoM  techoiqm  10  demlop  dim  dunensioQal  plaie/sheU  elements;  however,  hourglass 
modes  are  found  in  those  demems. 

In  this  paper,  two  and  three  dimensional  underint^rmed  elements  baaed  on  procedures 
suniltf  to  that  proposed  1^  Liu  et  dL  (8]  are  developed.  Tm  onphaais  is  placed  on  die  avoiding 
of  locking  and  the  removal  of  spurious  sin|ular  modes.  These  elements  are  applicable  to  beam, 
pfane  and  shell  bending  proMems,  and  more  m^xxtamly  they  are  suitaMe  to  anal^  mend  fonning 
processes. 

HGHT-NQDE  HEXAHEDRAI.  EI  FMENT  Let  US  consider  an  eight-node  hexahedral 
element  as  shown  in  Hgure  1.  The  spatial  coordinates  Xi  and  the  velocity  components  Vi  in  the 
element  are  approximated  by  linear  combinations  of  nodal  values  Xia  and  vu,  and  shape  functions 

Ht  C)  **  fdlows: 

NEN«8 

-I  (1) 


rsn 

Vi*  X 

-»  (2) 


N.(tii.o=^i+«)(i+n.»iKi+«)  .j 

where  the  subscripts  "i"  and  "a"  dexiote  coordinate  component  ranging  from  one  to  three  and  the 
element  node  number,  ranging  from  one  to  eight,  respectively.  The  referential  coordinates 

4,  111  and  C,  of  node  a  are  denoted  by  4a,  11a  and  Ca,  ie^)ectively. 

For  the  purpose  of  identifying  die  deformation  mod^  of  the  element,  let  us  define  the 
gradient  submatiioes  SaQ2)  and  otlm  column  rectors  as: 


r  N„(fi)  1 

l>u 

&(Q)* 

= 

b2a 

.  b3.  . 

i‘  =  [l,  1,1,1, 1, 1. 1, 1] 


in  “  [^1*  1^3,  X4f  X5,  X7,  X^J 


(6) 


y*  =  [yi.  y2.  ys.  y4,  ys.  ys.  y?.  ysl 


(7) 


gl  *  [Xj,  Z2,  Z3,  Z4,  Z5,  Zg,  Z7,  Zg] 


(8) 


ll\*[l..l.  1,-1.  1.-1,  1,-1] 


(9) 


lji  =  [l.-l.-l.  l.-l,  1,  1,-1] 


(10) 
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1,-1,>1,1,-1, 1. 1] 

(H) 

bi«[-i,  1,-1, 1, 1.-1, 1,-ij 

(12) 

|‘«H.  1.  1.1. -n 

(13) 

1,  l,.l,.l.  1, 1) 

(14) 

C*»[-1..1,.1..1, 1,1,1,  IJ 

(15) 

wliere  X.  andzare  the  nodal  coordinates  and  bi  is  the  ^'hourglass  vector,  I12  the 

hourglass  vector,  bs  the  T)([-hourglaas  sector  and  Ju  the  ^(^tourglass  vector. 

The  Jacobian  matrix  evaluated  at  the  center  of  an  element  can  be  shown  to  be: 


T,*! 


i,j»lA3 


Ci  ;y  Ci 


The  deienninant  of  the  Jacobian  matrix  is  denoted  by  Jo  and  the  inverse  matrix  of  XCQ)  ta  given  by 

& 

D*IDijl*i‘‘(Q)  (17) 

The  gradient  vectors  bi .  122  and  123  (which  are  evaluated  at (Q)  in  equation  (4).  can  be  shown 


bi  «  (bu)  *  ^Du|+Di2TI+Di3Q 

122  *  {bjt}  *^D2i|+D22n+D23CJ 

123  *  {bsi}  «  ^D31^+D32T|+D33Q 


The  strain  rate  i  is  approximated  by  expanding  it  in  a  Taylor  series  about  the  element 
center  up  to  bilinear  terms: 

Ti.  C) = 4(Q)+i4(fi)^+ii,(Q>n+i;(Q)C+ 

2g.4T,(Q)^ti+2i,,5(Q)n(;+2fi,5^(Q)C4  on 
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or 


fi  “  B»(^»  OXa 

a«l 


(22) 


where 


0  =  B.(Q)+B..t(Q)^+Ba.t,(Q)n+B..;((2)C+ 

2B.,4n(fi)^il+2B^,,;(Q)ilC+2B.,tt(Q)tt 


(23) 


The  first  term  of  the  right  hand  side  of  Equation  (21)  is  the  amstaat  strain  rates  evalutfed  at  the 
quadrature  point,  and  the  remaining  terms  are  linear  and  bilinear  strain  rate  terms.  After  some 
tedious  alg^ra,  it  can  be  shown  that  the  first  and  second  derivatives  of  Ba(j2)  with  respect  to 
natural  coontinates  are  given  by: 


=  ^Di2yi+Di3Y2l 

(24) 

II24  "  =^D22Yi +023^2) 

(25) 

Jtb.4  =  (Na.!^)  *^D32Yl+D33Tfel 

(26) 

bi.n  *  (N*,xn}  =^DuYi+Di3p] 

(27) 

b2.n  =  {Na,yT»}  =  |{D2lYl+D23]^l 

(28) 

b.n  =  {N.,!,,}  =^D3iYi+D33T^l 

(29) 

bl.;  =  (N„;}  =:^DuY2+Di2]&1 

(30) 

b2.;  =  =  ^D2iY2+D22Yj1 

(31) 

bs.;  =  (Na.z;}  =  ^D3lY2+D32Y3l 

(32) 

bi4n  =  {N8.x5h)  =^Di3Y4-(£\si)bi4-(i:*iXi)bi.Ti] 

(33) 

b24n  =  {Na,y4,,}  =:^D23Y4-(£Wbi,rto)bi.nl 

(34) 

23? 


=  (Nm4h)  *|{D33]r4*(]^ll^-0ib)Ju,t,l 

(35) 

fci.nC  *  =^DiiY4.(qyy)Jbj.,.(£\jj)jjj^^] 

(36) 

li2.n;  *  <N«,ynC)  =^*^iY4-(qki)bi.v(Etei)bi.;l 

(37) 

ba.nc  =  =|{D3iY4-(4Si)Jli.n-(EUi)bi.(;l 

(38) 

lii.U  =  =  8tDi2Y4'(q‘iii)iii.4-(riji)lii.;l 

(39) 

bzu  =  =^D22Y4'(qki)lii.4'tf2Si)li|.;] 

(40) 

b3.^;  =  <  =  ^D32Y4-(qUi)bi,^-(l^Si)Jbi.(;l 

(41) 

where 

Pi  =  Dii]^l+Di3j[l3 

(42) 

qi  *  Diib^-i-Diihx 

(43) 

li  =  Di2lll+Di^ 

(44) 

The  Yo  in  equations  (24)-(41)  are  the  stabilization  vectors  which  span  the  improper  null- 
space  of  B(Q).  They  are  given  by 

Yo  =  ho  ■  (ha2i)hi 

(45) 

where  i  is  the  summation  index  form  1  to  3.  Yo  are  orthogonal  to  the  linear  displacement  field  and 

provide  the  proper  stabilization  for  the  element  It  is  also  noted  that  the  ^-stabilization  element 
always  nieets  the  patch  test  while  the  h-stabilization  element  does  not  Belytschko  and  coworkers 
[1-3]  derive  these  vectors  from  consistency  requirement 

To  alleviate  volumetric  locking,  we  employ  the  ideas  underlying  selective/teduced 

integration  (Hughes  [10]).  g,(^,  q,  0  is  decomposed  into  two  parts:  the  dilatational  part  and  the 
deviatoric  part  The  dilatational  part  of  gradient  matrices  are  underintegrated  and  evaluated  only  at 
one  quadrature  point  Q,  to  avoid  volumetric  locking: 

(46) 

Expanding  about  the  element  center,  equation  (46)  can  be  written  as: 

233 

(47) 


B.(t  n.  0 «  b.(qk  BU(fi)4-*-B«a»n«^B5(a)C+ 

2B5,(fi)4Ti+2BjJ^(e)nC+2B55(Q)C^ 

where  B«(Q)  are  the  one-point-quadrature  gradient  submatrices  contributed  from  both  the 
dilatadonal  and  deviatoric  parts.  The  remaining  terms  on  the  right  hand  side  of  the  above  equation 
are  the  gradient  sutoiatrices  corresponding  to  non-constant  deviatoric  strain  rates.  It  is  no^  that 
the  element  using  the  gradient  matrices  as  in  equation  (46)  or  (47)  is  properly  underintegrated  and 
exhibits  no  hourglass  mode  if  the  element  internal  energy  is  evaluamd  by  using  the  multipie-point 
quadrature. 

The  element  developed  so  far  is  not  suitable  to  plate/shell  analysis  owing  to  the  shear  and 
membrane  locking  in  thin  structures.  To  remove  shear  locking,  the  gradient  submatrices, 
corresponding  to  tte  assumed  shear  strain  rates  is  written  in  an  orthogonal  corotational  coordinate 
system  rotating  with  the  element  (see  Figure  2)  as: 


n.  0  =■  B«(B)+  BSia!)5+Bts:i,(Q)n+B**{(a)C+ 
2Bjr5,(!i*n+2Btr;,?(2)nc+2Btr„(B)« 

(48) 

iwft.  1.  c) = B„<fi)+  Bi;;:5(a)t+Bf?:,«!)n+B^;(e)C+ 
2Bj;:5,(Q«n+2Bj^,{(a)iii:+2B5;;«(e)« 

(49) 

It.  C) «  B=(fi)+  BS4(B)«+B£',(B)il+B2:{(l!)t+ 
2BSs,«!)4’l+2BS;^C+2BS'„(e)« 

(50) 

i.r(i  1. 0 = B«y(a)+Bt;;i;(!»(; 

(51) 

fyzft.  1.  C)  =  B,.(2)+B?r5(2)4 

(52) 

BxCi  It.  C)  -  B><<S)+B£r<i(Q>>t 

(53) 

where  Bxx>  6yy<  Bzz>  Bxy>  Byz  and  Bzx  are  the  gradient  submatrices  corresponding  to  strain  rates 

ixx«  £yy.  ixyt  iyz  and  gzx.  respectively.  Here,  only  one  non-constant  term  is  used  for  each 
shear  strain  rate  component  such  that  the  modes  causing  shear  locking  are  removed.  The  normal 

strain  rates  keep  all  non-constant  terms  given  in  equation  (47).  In  B‘**^(Q)t  only  those  terms 
corresponding  to  a  parallelpiped  element  are  used  for  stabilization. 

To  detect  plastic  fronts  in  the  mesh  during  loading  and  unloading  more  accurately  for 
elastic-plastic  large  deformation  problems,  we  propose  to  use  a  four-point-quadrature  scheme 
instead  of  the  one-point-quadrature.  The  element  internal  force  vector  is  evaluated  at  the  four 
integration  points  located  as  follows: 
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Poilitl:  (  +  ^-.  +  ^.  +  ^);  Point2:  + 

Point3:  (-;j|  .+;^.-^);  Point4:  (  +  ^  (54) 

This  element  exhibits  no  hourgUss  mode  and  is  rank  sufficient 

By  assuming  that  the  Jacobian  is  a  constant  one  quarter  of  die  elemmtt  volume,  the  rfwwejit 
inmnal  force  vector  can  be  inlegraied  as  follows: 


where  4k  doiotes  the  natural  coordinates  of  the  intention  point  k  and  V  is  the  element  volume. 
The  element  internal  force  vector  can  be  rearranged  in  the  form: 


where  Ct  and  are  the  internal  force  vectors  resulting  from  the  one-point  quadrature  and  the 
stabilization  procedure,  respectively.  They  are  given  by: 

t  U  (4k)fel +t  12(4k)Jb+'C|3(4k)li3 
^  ^ 

4 

=  X  A  ''2l(4k)fel+'C22(4k)tl2+t23(4k)b3 

k.l  4  ~  ~  ~ 

'C3l(4k)lil+t32(4k)fe2+X33(4k)fe3 


gl(4k)tn'(4k)+g4(4k)^I2(4k)+g5(4k)X3l(4k) 

4 

fitib  =  X  ^  g2(4k>t22'(4k)+g6(4k)^l2(4k)+g7(4k)X23(4k) 
g3(4k)l^  (^)+g8(4k)^23(^)+g9(4k)t3  l(4k) 

where  the  superscript,  dev,  denotes  the  deviatoric  part  of  the  stress  (t^*''=Tij-^kk5ij)  and  the  other 
quantities  are  given  by 

gi(4)  =  Di  i(t1Yi+CY2+2tiCy4) 


g2(4)  =  D22(4Yi+CY3+24CY4) 


g3(4)  =  D33(4Y2+T1Y3-»'24qY4) 
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84(0  *  D22CYs 

(62) 

85(0  =  D33IIY5 

(63) 

|6(p  =  DnCY2 

(64) 

87(0  =  D334Y2 

(65) 

88(0  =  Dii^Yi 

(66) 

|9(p  =  DiiTlYl 

(67) 

It  is  noted  that  the  elements  developed  above  can  not  pass  the  patch  test  because  with  oik- 
point  quadrature  the  element  internal  forces  are  not  prq)erly  evaluated  if  the  elements  are  sdmwed 
(Belyt^hko  and  Bindeman  [7]).  To  remedy  this  drawback,  ga(j2)  are  replaced  by  the  uniform 
gradient  matrices,  ga,  defmed  by  Belytschko  et  al.  [1,2]: 


E.(^.  n.  0  dV 


where  Ve  is  the  element  volume.  Similarly,  equation  (4)  is  modified  as 


1.(0)  = 


(68) 


(69) 


and  the  stabilization  vectors  are  redefined  as: 

Yo  =  Ik*  ■  (i!a£i)0i 

which  span  the  proper  null-space.  Since,  the  element  internal  force  vector  can  be  evaluated  exactly 
when  the  element  is  subjected  to  a  constant  strain  rate  field,  the  use  of  the  uniform  gradient 
matrices  g.  leads  a  new  four-quadrature-point  element,  the  NUHEXIN-4  element,  which  passes 
the  patch  test 

The  three  dimensional  ASQBI  (assumed  strain  quintessential  bending  incompressible) 
element  is  developed  by  Belytschko  and  Bindeman  [7].  The  gradient  matrix,  g,  is  given  by 
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6u(4«  'n»  C) 
Byy(4*  0 
6b(4'  C) 

Bxy(4>  'H*  C) 

Byi(^»  'H*  C) 

_  6zx(4>  'n»  C) 


B  a  fi' 
aBfi 

,  Q  aB  ^ 
”  EB  Q 
aBE 
-B  ftB. 


hl.x7l+h2.x?2+h.474  -Vhi.x7l-Vh4.,?4  -Vh2ji?2-Vh4.x7i 

'Vhi,x7l'Vh4.x^  hl.y7i+h3.y^+h4.y^  -Vh3j,73-Vh4.x% 

'Vh2.x72-Vh4.x?4  -Vh3.x?3-Vh4.x74  h2.x?2+h3.r?3+h4,z74 

h3.y?3  h2.xE  Q 

Q  h2.2E  hi.v7i 


h3.273 


hi.y?l 

hi.x?i 


where 


hi*^Ti  h2  =  ^C  h3=*TiC  h4  =  4ilC 


;  V  is  ihe  Poisson's  ratio;  v  = 


1-v' 


To  compare  the  NUIffiXIN-4  and  ASQBI  elements,  let  us  write  out  the  gradient  matrix 
corresponding  to  the  NUHEXIN-4  element  can  be  shown  to  be: 


6xx(^>  tl*  C) 

Byy(^»  tl,  Q 
t],  Q 

Bxy(4»  "n»  C) 
ByzC^*  "H*  0 
_  6zx(4«  11*  0  _ 


a  a  fi" 

&a  Q 

0  qB  ^ 

aa  Q 

qBB 

-Baa, 
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8^ 


|Dii(n7i+C%+2iiC?4) 

^ii(ii7i+C%+2nC?4) 

'^ii(Tifi+C72+2nc74) 

D22C^ 


^22(4?l+ffi+2^C?4) 

^22(4?1+C%+24C?4) 

^22(4?I+C3+2i5C74) 

Di,C?2 


^33(^Ti+Tni+24ll74) 

J  MM  M 

^33(^%+|A+2^^4) 

|D33(^^+T^+25tj74) 

a 


a 


D33^?2 


D33TI73 


0 

«N# 


(73) 


It  can  be  seen  from  Equations  (71)  and  (73)  that  both  gradient  matrices  have  the  same 
number  and  locations  of  zero  and  non-zero  components.  Both  matrices  including  all  the 
stabilization  vectors  to  suppress  the  hourglass  modes  and  the  resulting  stiffness  matrices  are  rank 
sufficient  It  is  interesting  to  point  out  that  those  components  corresponding  to  the  assumed  riiear 
strain  rates  have  the  same  forms  but  with  different  coefficients.  It  is  believ^  that  the  BelytscMco- 
Bindeman  element  is  computationally  more  involved  since  the  gradient  matrix  is  written  in  terms  of 

the  spatial  derivatives  of  and  ^qCi  however,  the  gradient  matrix  of  NUHEXIN-4 

element  is  written  in  terms  of  q  and  ^  explicitly  so  that  the  stiffness  matrix  can  be  easily  obtained 
through  numerical  integration. 


EXAMPLES.  In  this  section,  a  variety  of  problems  including  beams,  plates  and  shells  are 
studied  to  investigate  the  performance  of  the  proposed  NUHEXIN-4  element.  Besides  this 
element,  runs  are  also  made  by  using  another  element  called  NUHEX-4,  in  which  the  virmal  work 
corresponding  to  the  dilatation  part  is  evaluated  by  using  a  four-point-quadratuie  scheme  instead  of 
the  one-point-quadrature  used  in  the  NUHEXIN-4  element  Since  the  NUHEXIN-4  element  is 
mainly  propos^  to  be  used  in  sheet  me^  forming  analysis,  the  applicability  of  NUHEXIN-4 
element  to  problems  of  thin  structures  is  also  studied  by  solving  the  standard  test  problems 
including  the  twisted  beam,  pinched  cylinder,  Scordelis-Lo  roof  and  hemispherical  shell,  which  are 
proposed  by  MacNeal-Harder  [1 1]  and  Belytschko  etaL  [12]. 

Clamped  Twisted  Beam.  In  this  example,  a  cantilever  twisted  beam  subjected  to  a  unifcmn 
shear  force  (F=1.0)  at  the  free  end  is  analyzed.  The  problem  statement  is  shown  in  Figure  3.  The 
computed  displacement  the  free  end  w  is  compared  to  the  analytical  solution  in  Table  1.  Unlike  the 
curved  beam  problem,  the  NUHEX-4  element  performs  better. 

Simply  Supported  Plate.  In  this  example,  a  simply  supported  plate  subjected  to  a 
concentrated  load  at  the  center  is  analyzed.  The  problem  statement  is  shown  in  Figure  4.  Due  to 
symmetry,  only  one  quarter  of  the  plate  is  modell^.  Two  cases  are  studied  in  this  example:  regular 
mesh  and  irregular  mesh.  The  computed  central  displacement  w  is  compared  to  the  an^ytical 
solution  in  Table  2  and  3.  Both  elements  compare  well  with  the  analytical  solution  for  the  uniform 
mesh;  whereas  the  IWHEXIN-4  performs  much  better  for  the  irregular  mesh. 

Pinched  Cylinder.  Figure  5.  shows  the  pinched  cylinder  subjected  to  concentrated  loads. 
Two  cases  are  studied  in  this  example.  In  the  first  case,  it  is  assumed  that  the  both  ends  of  the 
cylinder  are  free.  In  the  second  case,  it  is  assumed  that  the  both  end  of  the  cylinder  are  covered 
with  rigid  diaphragms  so  that  only  the  displacement  in  the  axial  direction  is  allowed  at  the  ends. 
Due  to  symmetry,  only  one  quarter  of  the  cylinder  is  modelled.  The  computed  central 
displacements  are  compared  to  the  analytical  solution  in  Table  4  and  5.  The  NUHEXIN-4  element 
performs  much  better  than  the  NUHEX-4  element  for  the  pinched  cylinder  with  diaphragms  and 
both  elements  perform  approximately  the  same  for  that  without  diaphragm. 
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Scordeli«-Lo  Roof.  Hgure6.  shows  the  Scordelis-Lo  roof  subjected  to  its  own  wdghL 
Due  to  symmetry,  only  one  quarter  of  the  roof  is  modelled.  It  is  assumed  that  the  berth  ends  of  the 
roof  are  covered  with  rigid  diaphragms  so  that  only  the  displacement  in  the  axial  direction  is 
allowed  at  the  ends.  The  computed  central  edge  di^lacement  w  is  compared  to  the  analytical 
solution  in  Table  6.  As  can  be  seen,  the  NUHEX-4  p^otms  better. 

Hemispherical  ShelL  Figure?  shows  the  hemisfrtiefe  shell  subjected  to  antisy^etrical 
concentrated  loads  at  its  bottom  ends.  Due  to  symmetry,  only  one  quarter  of  the  hmnispherical 
shell  is  modelled.  The  computed  radial  deflection  w  is  compared  to  the  analytical  solution  in  Table 

7.  Due  to  the  tknible  curvature  of  the  shell,  48  x  48  elements  are  required  to  modd  the  geometry 
correctly  widi  NUHEXIN-4  element 

CONCLUSIONS.  In  this  p^r,  the  multiple-quadrature-point  eight-node  hexahedral 
elements  are  developed.  The  emphasis  has  b^n  placed  on  the  removal  of  shear  and  volumetric 
locking  and  the  suppression  of  spurious  singular  modes.  The  stabilization  operations  are  obtained 
simply  by  taking  tte  partial  derivatives  of  the  generalized  strain  rate  vector  with  respect  to  the 
nan^  coordinates.  The  resulting  element  stiffness  matrices  can  be  explicitly  expressed  in  terms  of 
natural  cocurdinates  so  that  they  are  easier  to  compute  than  those  involving  spatial  derivatives.  The 
performances  of  the  propowd  elements  are  studi^  by  solving  beam,  phrte  and  shell  problems.  It  is 
demonstrated  that  the  solutions  are  satisfactory. 

Finally,  it  should  be  pointed  out  that  the  performances  of  the  NUHEX-4  and  NUHEXIN- 
4  elements  are  ^proximately  the  same  in  the  compressible  problems:  however  N(JHEXIN-4 
should  be  used  in  the  elastic-plastic  and  incompressible  problems  since  the  NUHEX-4  element 
suffers  from  volumetric  locking. 
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£  =  2.9x10^  v  =  0.22 

L=:12  H=l.l 

t  =  0.32  F  =  1.0 

Twist  =  90° 

Figure  3.  Problem  statement  of  the  clamped  twisted  beam 


Analytical  solution  w  =  0.005424 


Table.  1  Normalized  di^lacement  of  the  twisted  beam 
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-H  h- 


F,  w 


Regular  Mesh 


Irregular  Mesh 


E  =  3  X  10^  V  =  0.3 

L  =  20  t»0.2 

F  =  100.0 


Figure  4.  Problem  statement  the  simply  supported  plate 


Analvi 

tical  solution  w  =  0.021 138 

8x8x4 

16  X  16x4 

NUHEX-4 

0.943 

0.985 

0.987 

NtIHEXIN-4 

1.151 

1.034 

1.036 

Table.  2  Ncnmalized  displacement  of  the  singly  supported  plate  with  regular  meshes 


Analvi 

tical  solution  w  =  0.021 138 

KlHfl 

8x8x4 

16  X  16x4 

NUHEX-4 

0.611 

0.848 

0.939 

NUHEXIN-4 

0.704 

0.867 

0.984 

Table.  3  Normalized  displacement  of  the  simple  supported  plate  with  irregular  meshes 
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First  Case  (with  du4>hfagms) 

E=  1.05x10^  v  =  0.3125 
L=  10.35  t  =  0.094 
F  =  100.0  R  s  4.953 


Second  Case  (without  diaphragm) 

E  *  3  X  10^  V  *  0.3 
L  =  600.0  t:s3.0 

F  s  1.0  R  *  300.0 


Hguie  5.  Problem  staement  of  the  pinched  cylindo* 


Analytical  solution  w  s  0.1 1 37 


10x10x2 

15  X  15x4 

NUHEX-4 

0.927 

0.998 

1.005 

NUHEXIN-4 

1.145 

1.050 

1.055 

Table.  4  Normalized  dis{daoement  of  the  pinched  cylinder  without  diafriiragm 


TRRTWSnTit»H 

Liailnul 

10x10x2 

15  X  15x4 

20  x  20  x4 

NUHEX-4 

0.633 

0.870 

0.936 

NUHEXIN-4 

0.811 

0.934 

0.980 

Table.  5  Normalized  displacement  of  the  pinched  cylinder  with  du4>hragms 
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E  =  4.32x10*  v*0.0 

L  *  50.0  R  *  25.0 

t  s  0.25  Gravity  s  360.0/per  volume 

6  =  80® 

Figure  6.  Problem  statement  of  the  Scordelis-Lo  roof 


Analytical  solution  w  =  0.3024 


Elemem'"'-^ 

8x8x1 

16x16x1 

32x32x1 

NUHEX-4 

1.016 

1.011 

1.010 

NUHEXIN-4 

1.162 

1.144 

1.140 

Table.  6  Ntmnalized  displacement  of  the  Scordelis-Lo  roof 
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£  =  6.825x  10“^  v*0.3 

Radius  s  10.0  thickness  =  0.4 

Fs  1.0 


Figure  7.  PToblein  statement  of  the  hemi^herical  shell 
Analytical  solution  w  =  0.0924 _ 


16x1 

32x1 

48x1 

NUHEX-4 

0.615 

0.862 

0.954 

NUHEXIN-4 

0.638 

0.891 

0.984 

( ^elements  /side  x  elements/thickness ) 


Table.  7  Normalized  displacement  of  the  hemi^heiical  shell 
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ABSTRACT 

Light  combat  vehicles  are  playing  an  important  support  role  for  both  troops  and  other 
heavily  armored  combat  vehicles.  As  such,  they  have  a  much  greater  risk  than  in  previous 
roles  of  being  subjected  to  transient  loads  such  as  impact  and  overpressure  loads.  Propa¬ 
gation  of  ballistic  shock  from  an  impacted  region  to  the  critical  locations  and  attachment 
points  for  secondary  systems  can  cause  damage  and  misalignment  to  sensitive  equipments 
contributing  to  malfunction  and  reduction  of  vehicle  performance.  Accuracy  of  determina¬ 
tion  of  dynamic  response  of  these  vehicles  is  directly  dependent  on  the  degree  of  refinement  of 
the  generated  model  and  how  well  the  model  incorpcnraies  the  essential  features  of  the  vdbide 
and  correlates  to  its  important  characteristics  without  being  overburdened  by  non-essential 
details.  Additionally,  response  of  nonlinear  components  of  the  vehicle  in  high  frequency 
regime  may  influence  the  overall  global  response  of  the  vehicle.  As  a  result,  hatch  openings 
and  access  door  cutouts  with  unsymmetric  locations  may  have  to  be  incorporated  in  the  fi¬ 
nite  element  model  to  allow  fair  comparison  with  first  order  experiments  involving  a  stripped 
vehicle  hull.  The  current  study  is  an  attempt  to  assess  the  influence  of  multiple  rectangular 
cutouts  on  the  overall  transient  response  of  a  vehicle  hull  subjected  to  a  side-<Na  iir^iact  load. 


1.  INTRODUCTION 

The  analysis  of  the  dynamic  response  of  complex  systems  involving  structural  assem¬ 
blies  and  components  has  become  a  subject  of  considerable  research  because  of  its  practical 
significance  in  the  evaluation  of  structural  integrity  when  subjected  to  transient  loads  [1-4]. 
The  behavior  of  combat  vehicles  subjected  to  dynzunic  loads  is  of  particular  interest  to  the 
Army  because  of  the  need  to  ensure  survivabilty  and  minimize  degradation  of  performance 
in  both  primary  and  secondary  systems.  An  area  of  critical  concern  in  a  complex  system 
such  as  a  combat  vehicle  or  a  personnel  carrier  is  the  propagation  of  shock  from  an  impact 
point  on  the  vehicle  hull  to  the  location  of  the  driver  and  the  other  personnel  in  the  crew 
compartment  and  the  attachment  points  for  the  optical  equipments,  such  as  the  gunsights 
and  periscopes,  as  well  as  electronically  sensitive  command  and  control  devices.  The  fail- 
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ure  of  equipments  and  structures  in  such  a  system  resulting  from  shock  propagation  and 
vibrations  can  render  the  system  ineffective  and  vulnerable  to  enemy  attack,  leading  to  life 
threatening  situations.  Detailed  dynamic  response  analysis  of  such  systems  are  essential  in 
order  to  develop  nondestructive  evaluation  methods  to  ensure  survivability  of  critical  Army 
equipments  and  hardware. 

Armored  vehicles  must  survive  the  shock  resulting  from  non- penetrating  projectile  im¬ 
pact  and  be  able  to  retain  fighting  capability  after  this  impact.  Non-standard  shock  mea¬ 
surement  techniques  were  employed  by  W.  Scott  Walton  [5,6]  to  determine  ballistic  shock 
protection  requirements  for  armored  combat  vehicles.  Target  loading  histories  of  thin  plates 
were  developed  using  the  EPIC-2  Lagrangian  code  by  E.F.  Quigley  [7,8]  who  used  the  data 
as  input  to  the  ADINA  nonlinear  finite  element  analysis  code  to  predict  the  ballistic  shock 
environments  for  normal  impacted,  axisymmetric  targets. 

The  response  of  structures  at  critical  locations  is  directly  dependent  on  ballistic  shock 
propagation  from  the  point  of  load  application  to  the  point  of  attachment  of  secondary 
sensitive  equipments  when  subjected  to  high  frequency  loads,  such  as  projectile  impact  and 
close-in  blast  overpressure.  To  evaluate  the  post-impact  response  and  performance  of  the 
system,  both  frequency  domain  and  time  domain  response  of  the  system  should  be  conducted. 
However,  in  the  current  investigation  a  finite  element  approach  was  employed  to  obtain 
responses  at  specific  locations  of  the  model.  The  time  domain  responses  were  converted  to 
frequency  domain  responses  using  a  conversion  code  based  on  [9]. 

2.  PROBLEM  CONFIGURATION 

The  particular  vehicle  selected  for  this  simulation  is  the  Armoured  Personnel  Carrier 
(APC)  designated  as  APC  M113.  Overall  specifications  are  available  in  Jane’s  World  Ar¬ 
moured  Fighting  Vehicles  [10].  Detailed  dimensions  and  geometry  description  of  the  vehicle 
was  obtained  from  field  measurements  and  from  [11].  Overall  length,  width  and  height  of 
the  APC  are  4.863m,  2.686m  and  2.5m  respectively  while  the  height  of  the  hull  top  is  only 
1.828m.  Unloaded  weight  of  the  vehicle  is  9702  kg  while  the  fully  loaded  vehicle  weighs 
11156  kg.  Ground  clearance  for  the  loaded  vehicle  is  0.406m.  Details  of  the  track  assembly 
were  completely  left  out  since  the  hull  was  modeled  without  tracks  to  avoid  complexities  for 
the  3D  model  generation.  However,  subsequent  study  of  dynamic  response  due  to  transient 
loads  included  artificial  boundary  conditions  imposed  on  the  corner  nodes  of  the  bottom 
floor  to  avoid  large  scale  sliding,  rotation  and  overturning.  A  pictorial  view  of  the  entire 
vehicle  aissembly  is  shown  in  Figure  1. 

The  critical  locations  for  the  driver’s  and  the  commainder’s  seats  as  well  as  the  attach¬ 
ment  points  for  shock  sensitive  control  panels  and  periscopes  were  measured  from  an  actual 
vehicle  at  the  Combat  Support  and  Test  Activity  (CSTA)  at  the  Aberdeen  Proving  Ground 
and  the  corresponding  nodal  locations  on  the  vehicle  hull  were  determined  for  the  purpose 
of  response  prediction  since  the  simplified  model  did  not  include  any  internal  components. 
Additionally,  the  driver’s  hatch,  cargo  hatch  as  well  as  the  hatch  opening  for  the  comman¬ 
der’s  cupola  were  included  as  simulated  rectangular  cutouts  in  the  generic  model  on  the  top 
hull  surface  in  Figure  2  to  assess  the  influence  of  multiple  cutouts  on  the  overall  response  of 
the  model  as  a  first  step  towards  simulation  of  a  basic  vehicle  hull  without  any  tight  fitting 
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components  prior  to  Analysis  of  a  fuU-up  vehicle.  Although  the  actual  openings  in  the  vehicle 
hull  were  circular  shaped  exc^t  the  cargo  hatch  and  the  rear  door,  it  was  decided  to  use 
approximate  rectangular  cutouts  for  all  openings  because  these  could  be  quickly  generated 
through  elimination  of  a  cluster  of  rectangular  elements  in  appropriate  locations. 


3.  LOAD  ESTIMATION 

The  modeshapes  and  eigenfrequencies  analyses  of  the  hull  structure  in  ADINA  [12] 
do  not  require  transient  loading  of  the  vehicle.  However,  simulation  of  forced  response  of 
the  vehicle  require  imposition  of  impact  and  pressure  loads  on  one  side  of  the  vehicle  at  a 
specific  location  which  will  likely  result  in  shock  propagation  and  damage  to  primary  as  well 
as  critical  secondary  systems  inside  the  vehicle. 

Impact  load  due  to  side-on  impact  of  a  projectile  approximately  1.83  m  long  weighing 
approximately  6.8  kg  and  travelling  at  .914  km/s  is  calculated  <tssuming  that  the  rod  con¬ 
tinues  to  erode  at  a  constant  rate  determined  by  the  initial  velocity  of  the  rod  until  it  is 
fully  consumed  and  the  total  momentum  of  the  rod  is  imparted  as  impulse  to  the  side  of  the 
vehicle.  The  impact  load  is  imposed  as  a  concentrated  load  on  a  specific  location  at  the  side 
of  the  vehicle  and  it  is  given  as  a  step  function  with  a  constant  force  of  3382  KN  (760,000 
lb)  for  a  total  duration  of  .002  s. 


4.  FINITE  ELEMENT  MODEL  DESCRIPTION 

Prior  to  finite  element  model  generation  of  the  vehicle,  a  three-dimensional  model  of 
the  hull  was  developed  using  PATRAN  [13,  14).  The  first  step  in  model  generation  using 
PATRAN  is  the  development  of  an  initial  level  1  model  of  the  APC  which  includes  grids, 
lines  and  patches.  Grids  describe  points  on  the  model  2md  lines  represent  edges  while  patches 
describe  model  surfaces.  Hyperpatches  were  used  to  represent  solid  portions  of  the  three- 
dimensional  model.  Thus  a  simple  geometric  model  of  the  entire  hull  of  the  M113A  with 
access  openings  was  generated  upon  which  the  finite  element  model  would  be  based.  A 
geometric  description  of  the  vehicle  hull  is  given  in  Figure  2. 

Surface  normals  were  checked  for  each  group  of  elements  to  ensure  conformity  and  some 
rearranging  of  elements  was  found  to  be  necessary.  Since  the  transverse  bending  response  was 
considered  to  dominate  the  overall  response  problem  at  short  stand-off,  shell  elements  rather 
than  3-D  continuum  elements  were  used  to  represent  the  model.  Four-noded  shell  elements 
were  selected  to  model  the  entire  hull  assembly.  Each  rectangle  denotes  one  element  in  this 
figure.  Plots  of  nodes  and  element  numbers  generated  by  PATRAN  are  omitted  here  because 
of  overcrowding.  However,  location  of  impact  load  on  the  side  wall  as  well  as  some  critical 
node  locations  at  the  top  and  bottom  surfaces  corresponding  to  secondary  systems  inside 
the  vehicle  extrapolated  to  the  hull  model  are  indicated  on  this  figure  by  circles  as  shown. 

To  assess  the  influence  of  large  multiple  cutouts  on  the  overall  response  of  the  hull  model, 
it  was  decided  to  incorporate  simplified  cutout  shapes  only  on  the  top  hull  surface  in  the 
vicinity  of  the  critical  locations  for  the  secondary  equipment  attachment  points.  The  large 
ramp  door  at  the  rear  end  was  largely  ignored  for  the  first  order  model.  The  cutouts  were 
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approximated  as  square  or  rectangular  shaped  for  both  driver’s  and  commander’s  hatch  as 
well  as  crew  access  openings  to  allow  ease  of  modelling  and  inclusion  into  the  existing  basic 
Ml  13  hull  model.  Future  refined  models  could  be  generated  from  scratch  to  accomodate 
exact  shapes  and  location  of  openings  through  a  complicated  mapping  procedure  which  was 
avoided  during  this  study. 

The  finite  element  model  generated  by  PATRAN  consisted  of  an  assembly  of  31 1  ele¬ 
ments  with  322  nodes.  Thickness  of  each  shell  element  was  specified  to  be  equal  to  the  wall 
thickness  of  the  APC  hull  which  remained  constant.  For  stress  computation  each  element 
Wcis  allowed  to  have  a  2x2x2  integration  scheme.  Isometric  views  of  the  finite  element  model 
of  the  vehicle  from  a  fixed  vantage  point  are  shown  in  Figure  3a  and  3b.  Figure  3a  depicts 
the  top,  the  front  and  the  left  hand  side  wall  surfaces  while  in  Figure  3b  these  surfaces  were 
removed  as  indicated  by  the  addition  of  intermittent  lines  to  enable  the  viewer  to  have  a 
clear  view  of  the  bottom,  rear  and  far  side  walls  of  the  simulated  model. 

Initially  the  vehicle  was  not  constrained  from  rigid  body  motions  for  displacement  re¬ 
sponse  predictions.  However,  this  condition  caused  large  rigid  body  displacements  and  ro¬ 
tations  which  dominated  the  dynamic  response  and  exaggerated  the  distortional  response. 
Similar  problems  were  encountered  during  acceleration  response  computation.  Large  sliding 
motion  of  the  vehicle  for  the  unrestrained  model  would  require  correction  by  subtracting  the 
rigid  body  displacement  and  rotation  of  the  center  of  mass  of  the  vehicle  from  the  overall 
response.  For  subsequent  studies  a  restrained  model  with  5  constrained  degrees  of  free  Jom  at 
three  corner  nodes  on  the  bottom  surface  was  used  for  computation  of  the  transient  response 
at  critical  locations  of  the  vehicle  since  rigid  body  acceleration  response  could  not  be  easily 
ascertained  and  eliminated  from  the  overall  response. 

Due  to  a  lack  of  ground  dynamic  friction  coefficient  data,  the  restraining  effect  of  the 
ground  friction  upon  the  vehicle  track  and  wheels  at  the  contact  zone  could  not  be  accurately 
modeled.  However,  the  problem  could  be  bracketed  between  two  limiting  conditions  of 
unrestrained  sliding  and  rotation  corresponding  to  zero  ground  friction  and  fixed  restraint 
corresponding  to  a  peak  ground  friction  allowing  almost  no  sliding  which  could  be  achieved 
by  fixing  appropriate  degrees  of  freeom  at  the  bottom  surface  corner  nodes  of  the  vehicle. 
The  limiting  condition  of  fixed  restraint  has  been  represented  in  this  current  study. 

5.  MATERIAL  MODEL 

During  the  process  of  model  generation  material  property  identification  numbers  are 
assigned  to  the  element  groups  as  they  are  created.  In  the  current  investigation  only  one 
material  identification  was  used  to  designate  all  groups  of  elements  in  the  entire  hull  con¬ 
structed  from  5083  aluminum.  Thickness  of  each  shell  element  equalled  the  hull  thickness  of 
3.175  cm  which  remained  uniform  throughout  the  structure. 

For  the  transient  response  analysis,  a  linear  elastic  isotropic  material  model  for  the  shell 
elements  in  the  ADINA  code  was  used.  The  finite  element  analysis  employed  the  following 
values  for  5083  aluminum:  Young’s  modulus  =  68,950  MPa.  Poisson’s  ratio  =  0.33,  and 
mass  density  =  2.7  gr/cm^. 


Node  134  -  Center  of  Loading. 

Node  141  -  Instrument  Panel  Attachment  Point.  Node  283  -  Driver's  Seat  Location. 

Nodes  49,62,72  -  Periscope  Attachment  Points.  Node  306  -  Commander's  Seat  Location. 

Figure  3a.  Top  and  side  wall  node  locations.  Figure  3b.  Node  locations  at  the  bottom  floor. 

Figure  3.  Critical  node  locations  for  the  finite  element  model. 
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Figure  4.  Side  wall  deformation  at  .0022  s  from  impact. 


6.  DYNAMIC  RESPONSE  ANALYSIS 


Prior  to  undertaking  a  dynamic  response  analysis  of  the  structure  subjected  to  transient 
loads,  a  free  vibration  analysis  was  conducted  and  the  first  20  mode  shaptes  and  eigenfre- 
quencies  were  determined  using  the  lumped  mass  as  well  as  the  consistent  mass  formulations 
in  the  ADINA  code.  This  study  was  useful  for  verification  of  the  finite  element  model  of  the 
APC  and  indicated  the  type  of  deformation  and  the  length  of  time  it  takes  to  respond  to 
an  unit  impulse  from  an  analysis  of  the  mode  shapes  and  time  periods  of  oscillation  of  the 
structure. 

The  dynamic  analysis  focused  on  predicting  structural  deformation  of  the  hull  due 
to  a  side-on  impact  load.  Additionally,  location,  magnitude  and  time  of  occurrence  of 
peak  displacements  and  accelerations  at  critical  locations  near  sensitive  equipments  and 
crew  positions  are  of  interest  due  to  ballistic  shock  propagation  and  lethal  damage  to 
coupled  secondary  systems.  A  nondestructive  methodology  for  the  assessment  of  vulner- 
abilty/survivability  of  armored  vehicles  and  personnel  carriers  subjected  to  ballistic  shock 
damage  to  the  primary  structure  and  propagation  to  the  attachment  points  of  coupled  sec¬ 
ondary  sensitive  systems  due  to  nonpenetrating  impact  and  close-in  blast  loads  needs  to  be 
developed  cis  a  useful  predictive  tool  from  systematic  reduction  of  time  domain  response  to 
shock  spectra  at  critical  locations. 


6.1  Results  and  discussions 

The  time  domain  response  analysis  was  conducted  using  the  Newmark  implicit  integrar 
tion  scheme  and  the  lumped  mass  as  well  as  the  mode  superposition  method  using  the  first 
100  modes  with  the  ADINA  code  in  which  we  used  a  constant  time  step  of  .0001  s  for  the 
first  2000  cycles  corresponding  to  a  total  response  time  of  0.2  s.  Figure  4  shows  the  deforma¬ 
tion  response  at  .002  s  corresponding  to  the  occurrence  of  peak  displacement  at  the  impact 
point.  The  scale  chosen  for  the  hull  configuration  is  1/30  and  the  magnification  factor  used 
for  deformation  plot  is  1.0.  Considerable  deformation  of  the  vehicle  hull  can  be  observed 
at  the  sidewall  in  the  vicinity  of  the  impact  point  at  node  134  where  a  concentrated  time 
dependent  step  load  of  3382  KN  was  applied  initially  and  maintained  at  a  constant  level  for 
a  total  duration  time  of  .002  s. 

For  the  model  with  lumped  mass  formulation,  the  maximum  displacement  at  node 
134  was  computed  to  be  9.738  cm  (3.834  in)  which  occurred  at  .0022  s.  This  compares 
favorably  with  the  predicted  peak  resultant  displacement  of  9.726  cm  (3.829  in)  occurring 
at  .002  s  at  the  same  location  of  the  basic  vehicle  hull  with  no  cutouts  using  the  lumped 
mass  formulation.  The  code  predicted  displacement  and  acceleration  responses  at  specified 
node  points  in  all  three  directions.  However,  the  responses  along  the  lateral  X-direction 
were  consistently  at  lecist  an  order  of  magnitude  higher  than  those  along  the  longitudinal 
and  vertical  directions  of  the  vehicle.  This  is  caused  by  the  side-on  impact  load  which 
is  predominantly  in  the  transverse  thickness  direction  acting  normally  upon  the  sidewall. 
The  displacement  and  acceleration  responses  of  the  left  sidewall  at  node  134  along  the  X- 
coordinate  direction  obtained  using  the  lumped  mass  formulation  are  shown  in  Figure  5. 
Peak  displacement  and  a.  -eleration  responses  occur  at  early  times  followed  by  lateral  elastic 
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tigure  5.  Displacement  and  acceleration  responses  at  node  134  along  the 
X-coordinate  using  the  lumped  mass  formulation  only. 
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oscillations. 

The  acceleration  time  domain  data  was  analyzed  and  converted  to  a  frequency  domain 
shock  response  spectrum  using  a  recursive  Altering  procedure  [9].  A  time  step  of  0.0001  s  was 
selected  for  this  computation  .  The  dashed  curve  corresponds  to  a  critical  damping  factor  of 
0.05  while  the  continuous  curve  employs  a  critical  damping  factor  of  0.01.  The  influence  of 
damping  on  the  response  spectra  is  clearly  demonstrated  in  this  figure.  The  shock  spectrum 
analysis  from  the  modal  superposition  method  using  first  100  modes  indicates  that  the  peak 
acceleration  response  is  contained  within  the  initial  frequency  range  from  0-500  Hz  followed 
by  an  uniformly  spaced  sequence  of  harmonics  continuing  at  higher  frequency  levels.  Results 
of  similar  computation  using  modal  superposition  and  the  first  hundred  vibratory  modes 
exhibit  similar  trends  as  shown  in  Figure  6.  Displacement  and  acceleration  responses  as  well 
as  an  analysis  of  the  shock  spectra  along  the  X-coordinate  at  node  location  141  which  is  the 
site  of  the  attachment  point  for  the  driver  instrumentation  panel  to  the  left  sidewall  obtained 
using  the  mode  superposition  method  in  ADINA  are  shown  in  Figure  7.  In  this  case  the 
peak  acceleration  response  appears  to  be  concentrated  in  the  low  frequency  range  of  0-500 
Hz.  Computations  using  both  mode  superposition  and  lumped  mass  formulations  at  several 
other  critical  locations  remote  from  the  impact  point  have  been  completed.  However,  the 
predictions  could  not  be  included  here  due  to  a  lack  of  space  and  will  be  presented  later. 

Comparison  of  shock  spectra  at  critical  locations  remote  from  the  cutout  regions  be¬ 
tween  the  hull  model  with  access  openings  and  the  basic  hull  without  cutouts  show  very  little 
change  in  shock  response  characteristics  since  shock  propagation  from  the  impact  point  to 
these  remote  locations  such  as  the  driver’s  and  the  commander’s  seat  locations  projected  to 
the  bottom  floor  remains  relatively  unaffected.  However,  significant  change  in  peak  accelera¬ 
tion  magnitudes  and  shock  response  characteristics  could  be  observed  in  all  three  coordinate 
directions  at  the  three  periscope  locations  in  the  vicinity  of  the  simulated  cutout  for  the 
driver’s  hatch  opening  and  also  to  some  extent  at  the  instrument  control  panel  attachment 
point  at  the  front  left  hand  side  wall.  These  are  caused  possibly  by  the  alteration  of  shock 
propagation  path  to  the  critical  locations  due  to  the  presence  of  cutouts  in  the  immediate 
neighborhood  contributing  to  reduction  in  mass  and  stiffness  which  also  affect  the  transient 
response  of  the  vehicle  at  these  locations. 


6.2  Conclusions 

Modern  combat  vehicles  are  increasingly  carrying  a  variety  of  sensitive  components 
which  are  susceptible  to  shock  damage.  The  transmission  of  shock  through  the  structure 
to  critical  components  considerably  remote  from  the  impacted  zone  can  result  in  the  loss  of 
combat  capability  inspite  of  crew  survivability  and  retention  of  structural  integrity  of  the 
vehicle. 

As  a  first  step  towards  the  development  of  a  vulnerability  assessment  methodology 
to  predict  damage  to  critical  components  from  transient  loads,  a  simplified  model  (1738 
degrees-of-freedom)  was  generated  for  use  in  the  ADINA  code  to  compute  the  response 
of  the  U.S.  Army  Ml  13  personnel  carrier.  A  shock  response  analysis  was  applied  to  the 
acceleration  histories  to  obtain  shock  spectra  at  these  locations.  The  spectra  at  critical 
locations  indicated  that  the  simplified  model  contained  no  structural  frequencies  above  5 
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Displacement  and  acceleration  responses  at  node  141  along  the 
X-coordinate  using  mode  superposition  and  lumped  mass  formulation. 
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KHz.  Additionally,  the  spectrum  at  the  impact  point  exhibited  an  uniformly  spaced  sequence 
of  harmonics  continuing  beyond  the  cut-off  frequency  of  5  KHz  and  having  periods  equal  to 
multiples  of  the  load  duration.  Calculations  were  repeated  using  modal  superposition  based 
on  the  first  100  vibratory  modes  with  a  cut-off  frequency  of  approximately  500  Hz.  Again  no 
structural  frequencies  higher  than  the  500  Hz  cut-off  were  detected,  but  the  same  sequence 
of  harmonics  were  ol^rved  at  the  impact  point.  These  harmonics  are  an  expected  artifact  of 
the  load  duration  and  has  time  periods  which  are  multiples  of  the  the  impact  load  duration 
time  of  0.002  s.  The  modal  superposition  calculation  demonstrated  that  while  the  deflection 
histories  can  be  accurately  reproduced  with  the  first  100  modes,  the  acceleration  histories 
and  shock  spectra  may  require  considerably  higher  number  of  modes.  Some  problems  with 
aliasing  resulting  from  discrete  sampling  of  time  signals  were  encountered  but  could  be 
avoided  by  increasing  the  sampling  rate.  The  study  indicates  that  the  time  integration  and 
superposition  n^thods  in  combination  with  the  shock  spectral  techniques  can  be  useful  in 
predicting  damage  due  to  shock  propagation  in  structures. 

Comparison  of  response  of  the  model  with  access  openings  with  that  of  the  basic  vehicle 
hull  with  no  cutouts  indicates  a  small  shift  of  the  eigenfrequencies  to  somewhat  higher 
natural  frequencies  with  very  minor  alteration  in  corresponding  modeshapes  for  the  cutout 
model  possibly  due  to  the  mass  loss  in  the  cutout  regions.  Both  frequency  domain  and 
time  domain  response  at  critical  locations  remote  from  the  cutout  regions  for  the  vehicle 
model  with  access  holes  compare  favourably  with  the  response  of  the  basic  vehicle  since 
propagation  of  shock  to  these  regions  remain  unaffected.  However,  in  the  vicinity  of  the 
cutouts,  shock  propagation  to  critical  regions  are  significantly  affected  due  to  alteration  of 
the  shock  propagation  path  and  reflection  of  energy  from  the  interface  of  the  cutout  back 
to  the  source  resulting  in  corresponding  influence  on  the  shock  spectral  response  and  visible 
alteration  in  peak  frequency  magnitudes  when  compared  to  those  from  the  basic  vehicle  hull 
without  any  access  openings. 
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ABSTRACT 


We  have  developed  a  quantitative  theory  for  the  mixing  of  fluids 
induced  by  a  random  velocity  field.  The  theory  provides  a  quantitative 
prediction  for  the  growth  of  the  mixing  region.  There  are  three  distinct 
regimes  for  the  asymptotic  scaling  behavior  of  the  mixing  layer,  depend¬ 
ing  on  the  asymptotic  behavior  of  the  random  velocity  field.  The  asymp¬ 
totic  diffusion  is  Fickian  when  the  ctxrelation  fimction  of  the  random  field 
decays  rapidly  at  large  length  scales.  Otherwise  the  asymptotic  diffusion  is 
non-Fickian.  The  scaling  behavior  of  the  mixing  layer  driven  by  a  general 
random  velocity  field  is  determined  over  all  length  scales.  Our  results 
show  that,  in  general,  the  scaling  exponent  of  the  mixing  layer  is  non- 
Fickian  on  all  finite  length  scales,  hi  the  Lagrangian  picture,  due  to  the 
non-linearity  of  the  effective  dynamical  equation  derived  from  file  Taylrv 
diffusion  theory,  the  mixing  layer  is  not  a  firactal  even  if  the  random  velo¬ 
city  field  is  a  fractal. 


Introduction 

The  study  of  the  mixing  induced  by  random  fields  becomes  increasingly  important 
in  the  study  of  fully  developed  turbulence,  enhanced  oil  recovery  processes  and  ground 
water  ecology.  For  a  tracer  flow  through  a  random  velocity  field,  a  mixing  layer  is 
devel(^>ed  between  the  tagged  and  the  untagged  region.  The  mixing  region  expands  as 
time  evolves.  In  the  case  of  ground  water  ecology,  the  random  velocity  field  is  caused  by 
the  random  permeability  field  through  the  Darcy’s  law.  The  purposes  of  our  study  is  to 
predict  the  statistical  properties  of  the  fluids,  such  as  the  size  of  the  mixing  regime,  or  the 
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efifective  macroscopic  diffusion  coefficient,  from  the  statistical  properties  of  the  randcan 
field.  Let  lit)  be  the  size  of  the  mixing  at  the  time  r.  How  does  /  grow  as  a  function  of 
rimft  r?  At  what  rate  will  /  grow  at  very  large  time?  In  general.  /  can  take  a  quite  general 
functional  form.  Let  us  first  define  the  scaling  e^qxtnent  of  a  general  function.  For  any 
positive  differentiable  function  /  it),  we  expr^  /  (r)  in  a  multi-length-scale-fractal  form 

/(0=n  (1) 

where 

=  (2) 

aim 

is  the  multi-length-scale  fractal  e}q)onent  at  scale  t  and  a  it),  determined  by 

ta<i(0  =  ln/(/)-lii(<)^^.  (3) 

is  the  multi-fractal  coefficient  of  fit).  In  general,  the  exponent  of  a  function  may  vary 
with  the  scale.  If  h  (r)  is  constant,  then  it  follows  fiiat  a  it)  is  also  a  constant.  The  result  is 
a  pure  power  law.  In  this  case,  /  (0  is  a  fractal  in  the  sense  that  results  obtained  from  dif¬ 
ferent  scales  are  related  by  a  simple  scaling.  Knowing  the  value  of  /  at  a  given  length 

scale  and  its  scaling  exponent  b,  the  value  of  /  over  all  length  scales  can  be  determined 

by  a  simple  scaling.  However,  the  fractal  treatment  of  the  mixing  layer  is  for  simplicity, 
rather  than  necessity.  Even  if  we  take  the  permeability  field  as  a  fractal  random  field,  the 
velocity  field  obtained  from  Darcy’s  may  not  be  a  fractal  field,  except  fm  a  weak  hetero¬ 
geneity  where  the  linear  (in  terms  of  the  fluctuation  of  the  permeability  field)  solution  of 
Darcy’s  law  can  be  used.  In  general,  the  statistical  pr(^)erties  of  a  random  field  can  vary 
as  the  length  scale  changes.  Therefore  a  multi-fractal  theory  is  more  suitable  for  studies 
involving  a  wide  range  of  length  scales,  such  as  data  including  both  laboratory  experi¬ 
ments  and  field  tests  [21].  The  length  scale  in  the  laboratory  is  on  the  order  of  a  fraction 
of  a  meter  and  the  length  scale  of  the  interwell  spacing  in  an  oil  field  may  be  on  the  order 
of  a  kilometer. 

By  applying  the  renormalization  group  perturbation  themy,  we  have  developed  a 
multi-length-scale-fractal  theory  for  the  mixing  of  the  fluids  induced  by  random  fields 
[14,24,25,28].  The  theory  provides  an  explicit  analytic  prediction  for  the  growth  of  the 
mixing  regime  of  the  fluids  and  an  effective  macroscq>ic  diffusion  coefricient  induced  by 
random  velocity  fields,  or  the  heterogeneous  permeability  fields,  over  all  time  scales. 
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The  results  of  the  exact  numerical  computation  using  the  front  tracking  method  agree 
with  the  prediction  of  the  theory  very  well  over  all  time  scales  without  any  fitting  para- 
menters  [7-11]. 

The  study  of  diffusion  induced  by  random  field  has  a  long  history.  Many  methods 
have  been  developed.  See  [1-6,12,13,15-20,22,23]  for  fiirfiier  references. 

Asymptotic  Scaling  Relation 

The  relationships  among  the  asymptotic  exponents  of  the  fluid,  diffusion  coefficient 
and  random  field  have  been  by  several  different  methods  [14,24,25,28].  These  methods 
gave  the  same  results.  The  results  are 

1  floe  CCfloD 

Yoo  =  max{  y ,  1+“^*  1  +  )}  W 

and 

Yoo  =  max{0,  1  +  floo,  1  +  afloo }.  (5) 

Here  yoo,  y-  and  P<»  are  the  asymptotic  scaling  exponents  of  the  mixing  regime,  of  the 
effective  macroscopic  diffusion  coefficient,  and  of  the  cmrelation  function  of  the  velocity 
(or  permeability)  field  respectively,  a  is  a  parameter  that  characterizes  the  rate  of  remo¬ 
val  of  the  infrared  cut-off.  Equations  (4)  and  (5)  show  that  there  are  three  regimes  with 
distinct  scaling  behavior.  These  three  regimes  are  connected  by  two  critical  points: 
flo.  =  -1  and  flea  =  0.  For  an  asymptotically  rapidly  decaying  velocity  correlation  func¬ 
tion,  fleo  <  -1,  the  asymptotic  diffusion  is  Pickian.  This  regime  is  characterized  by  the 
properties  that  the  correlation  function  is  integrable  over  one  spatial  dimension  and  that  a 
finite  asymptotic  limit  exists  for  the  diffusion  coefficient.  For  asymptotically  slowly 
decaying  velocity  correlation  function,  -1<  floo  <  0,  the  asymptotic  scaling  is  non- 
Fickian  and  the  diffusion  coefficient  diverges  at  large  time.  The  analysis  of  the  field  data 
shows  that  this  regime  is  important  for  ground  water  ecology.  The  third  regime  is 
specified  by  0  <  fl.  In  this  regime,  the  scaling  is  non-Fickian.  A  striking  new  result  has 
been  obtained  in  this  regime:  the  asymptotic  scaling  is  non-unique  before  specifying  how 
the  infra-red  cutoff  is  removed  from  the  system.  [25,28].  In  other  words  the  scaling  rela¬ 
tion  contains  a  family  of  solutions  which  depends  on  the  rate  of  removal  of  infrared  cut¬ 
off.  Such  non-uniqueness  is  intrinsically  due  to  the  fact  that  infrared  cutoff  of  the  corre¬ 
lation  fimction  of  the  random  velocity  field  is  not  removable  for  0  <  fl*.  As  a  tends  to 
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infinity,  which  corresponds  to  the  fast^t  removal  of  the  infrared  cuU^,  the  scaling 
exprment  tends  to  infinity  as  well. 

For  a  laminar  shear  flow  model  with  a  fractal  Gaussian  random  velocity  field,  the 
problem  can  be  treated  exactly  [3,28].  It  has  been  shown  that  the  solution  given  in  [3] 
corresponds  to  a  particular  solution  with  a  specific  infrared  cutoff  among  the  family  of 
solutions  given  in  [28].  In  the  case  of  an  infrared  non-removable  systmn,  any  specific 
choice  for  the  infrared  cutoff  may  ^ve  spurious  phase  boundaries  in  the  asymptotic  scal¬ 
ing  laws. 

Transient  Scaling  Behavior 

The  scaling  laws  given  by  (4)  and  (S)  are  valid  for  very  large  scales  only.  We  now 
consider  the  scaling  laws  on  finite  length  scales.  In  the  Eulerian  picture,  the  motimi  of 
the  tracer  flow  can  be  modeled  by  a  stochastic  linear  transport  equation 

s,+-^Vs  =  diV^s,  (6) 

where  s  is  the  saturation  value  of  the  fluid,  so  that  s  =  1  fw  tagged  fluid  and  s  s  0  for 
untagged  fluid.  V  is  the  spatial  gradient  operation.  di  is  the  local  molecular  diffusion 
coefficient,  l^is  the  velocity  of  the  fluid.  We  assume  the  velocity  field  is  random  and  sta¬ 
tionary.  Therefore  it  is  a  function  of  the  spatial  variables  only.  Fot  tracer  flow,  the  velo¬ 
city  field l^is  determined  from  Darcy’s  law  and  the  condition  of  incompressibility.  Each 
individual  solution  for  a  given  realization  of  the  random  variable  does  not 

play  a  significant  role.  The  statistical  behavior  of  the  tracer  is  determined  from  the 
ensemble  mean  saturation  value  <sit^>.  Although  (6)  is  a  linear  equation  with  respect 
to  the  independent  variables  t  and  '3^  it  is  a  non-linear  equation  respect  to  the  random 
fields  "i^and  s.  The  effective  equation  for  <s>  will  not  be  obtained  from  (5)  in  a  trivial 
way.  It  has  been  shown  that  the  effective  equation  for  <s  >  can  be  written  as  [14, 24, 25] 

+  cf/V2<5(ii,/)>  +  0(6/).  (7) 

Herel^O  s  is  a  constant  vector  and  is  determined  by  the  steady  pressure  drt^  applied 
to  the  fluid  field.  <■>  denotes  ensemble  average.  ^  fluctuation  of  the 
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random  velocity  field  due  to  the  rock  heterogeneity  and  is  a  function  of  the  q)atial  vari¬ 
ables  only  for  a  stationary  velocity  field.  The  shape  of  die  tracer  interface  at  r  =  0  is 
given  by  For  simplicity  we  assume  that  the  initial  interfroe  between  tagged  and 

untagged  fluid  is  a  plane  with  its  normal  direction  pointing  along  the  direction  of  l^o- 
Then  the  effective  equadmi  in  the  longitudinal  direction  is  given  by  [14, 24. 27] 

+%d<s(t.x)>  =  c  ^  <5  +  0(5v^).  (8) 

Here 


ait)  =  <6v  1  (vo/)6v  1  ivQt')>  J[j'<?(vo(t  -  O)  +  d/  (9) 

is  the  effective  longitudinal  diftiision  coefticient.  5v  i  is  the  Imigitudinal  component  of 
the  fluctuation  of  the  random  velocity  field.  ^  is  the  longitudinal  velocity  correlation 
function.  In  (19)  we  haw  assumed  diat  the  statistical  prqierties  of  the  random  velocity 
are  translational  invariant.  Therefore  the  C(m‘elation  function  of  the  velocity  field 
depends  only  on  the  relative  separation  between  the  two  points.  In  the  limit  of  weak 
fluctuations,  the  term  prq>ortional  to  5v^  is  negligible.  Then  for  the  planar  initial  data, 
the  solution  to  (8)  is  given  by 

1  X-Vnt 

<5(ar,/)>  =  ^erfc(-y^).  (10) 

where  erfcQ  the  complimentary  eirtH*  function  and 


/  (0  =  =  2[jf  (t  -  t')a{t')dt'  +  ditf^.  (1 1) 

Equation  (10)  shows  that  the  solution  <s{x,t)>  scales  as  /  (r).  It  follows  that  the  mixing 
zone  scales  as  /(r)  as  well.  We  define  l{t)  as  the  mixing  length,  the  size  of  the  mixing 
zone.  From  definition  (2),  scaling  exponent  of  the  mixing  regime  is  given  by 


7(0  = 


rf(ln(;(0))  ^  if.  _ 

dm))  2^  ■ 


and  the  scaling  exponent  of  the  effective  diffusion  coefficient  is  given  by 


V(0  = 


d(ln(«(r)))  _  1  r 

dm))  2  ^ 


t^(l^K)dK  +  dit 


(12) 


(13) 
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The  first  two  regimes  of  the  asymptotic  scaling  relations  shown  in  (4)  and  (S),  i.  e. 

% 

max{— .  1+“^}  max{0,  1  +  p,.  }  can  be  determined  from  (12)  and  (13)[14,24].  It 
2  2 

can  be  shown  that,  at  the  critical  point  p«.  =  -1,  the  system  has  the  slowest  rate  for 
approaching  its  asymptotic  exponent  [24],  Equations  (12)  and  (13)  are  only  valid  for  the 
systems  in  which  the  infrared  cutoff  is  removable.  In  the  case  where  the  infrared  cutoff 
can  not  be  completely  removed,  one  must  consider  the  velocity  correlation  function  in 
Fourier  space  with  an  infrared  cutoff.  Then  the  asymptotic  scaling  exponents  depend  on 
the  rate  of  the  removal  of  the  infrared  cutoff  and  a  family  of  solutions  exist.  See  [28,25] 
fOT  the  analysis  of  asymptotic  scaling  exponent  of  the  infrared  non-removable  system. 

Equations  (12)  and  (1 3)  show  that  the  exponent  of  the  mixing  length  and  that  of  die 
diffusion  coefficient  at  length  scale  VQt  depend  on  all  length  scales  less  than  VQt.  This 
explains  the  dependence  of  the  mixing  length  exponent  on  the  flow  history.  In  general, 
the  ratio 

is  non-zero  for  finite  t  unless  q  is  proportional  to  a  delta  function.  It  follows  from  (12) 
and  (13)  that,  in  general,  the  scaling  of  the  mixing  zone  is  non-Fickian  or  finite  length 
scales  unless  the  random  velocity  is  a  white  noise. 

Equations  (4)  and  (5)  show  that  molecular  diffusion  does  not  play  a  role  in  the 
asymptotic  scaling  exponents  of  the  mixing  length  and  diffusion  coefficient.  The  molec¬ 
ular  diffusion  is  important  on  small  length  scales  only.  Assuming  q  is  non-singular  on 
small  length  scales,  then  from  (12)  we  have  ■yfO''  =  1/2  when  d/  ^0,  and  ^(0)  =  1  when 
di  =  0.  For  small  length  scales,  (15)  gives 

q{Q)t  +  di 
qi0)(  +  2di' 

Therefore  at  the  length  scale  vqI  >  VQdilq{0)  the  molecular  diffusion  is  negligible  and 
Y(r) «  1.  The  data  set  given  in  [21]  shows  that  scale  exponent  y  is  about  1  at  the  smallest 
length  scale  in  the  given  data.  This  implies  that  the  local  molecular  diffusion  is  negligi¬ 
ble  at  all  length  scales  for  the  given  data  set,  including  the  short  length  scale  data 
obtained  from  laboratory  experiments.  Equation  (12)  also  shows  that  for  a  given 
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correlation  function  and  length  scale  vq/.  y  ^  &  decreasing  function  of  the  strengdi  of  die 
molecular  diffusion.  In  other  words,  for  fixed  t,  the  larger  d/  is,  the  smaller  y  is.  This  is 
due  to  the  fact  that  the  constant  local  molecular  diffusion  has  Fickian  scaling  while  the 
diffusion  induced  by  a  random  velocity  has  non-Fickian  scaling  (except  when  the  random 
velocity  is  a  white  noise). 

The  properties  discussed  here  are  in  good  agreement  with  the  general  features  of  the 
laboratory  and  field  data  [21].  This  data  set  shows  that  the  exponent  of  the  mixing  layer 
is  about  1  at  short  length  scales,  and  is  less  than  1  at  large  length  scales.  It  also  confirms 
that  the  exponent  of  the  mixing  region  is  larger  than  1/2,  i.e.  non-Fickian  over  all  length 
scales  supplied  by  the  data  set. 

The  main  features  of  the  multi-fractal  theories  relative  to  the  firactal  theories,  are 
the  allowance  of  a  length  scale  dependence  of  physical  quantities  and  the  inclusion  of  the 
transient  effect  between  different  length  scales.  Equations  (12)  and  (13)  show  that,  at 
any  given  length  scale  vq/,  the  scaling  exponent  of  the  mixing  layer  and  that  of  the  diffu¬ 
sion  coefficient  depend  on  the  behavior  of  the  correlation  function  of  the  random  velocity 
on  aU  length  scales  which  are  smaller  than  /.  This  is  a  transient  effect.  Our  analysis  has 
shown  that  two  factors  contribute  to  the  variation  of  the  scaling  exponent  of  the  fluid  at 
different  length  scales.  One  is  the  variation  of  the  scaling  exponent  of  the  random  velo¬ 
city  field  at  current  length  scale.  The  other  is  the  variation  of  the  scaling  exponent  of  the 
random  velocity  field  at  all  length  scales  smaller  than  the  current  length  scale  (a  transient 
effect).  If  the  exponent  of  the  velocity  correlation  function  /  varies  slowly  over  a 
sufficient  large  range  of  length  scales,  then  the  exponent  of  the  mixing  layer,  'y(/), 

approaches  an  instantaneous  fractal  exponent  y^(r)  =  max{y,  1+-^^},  where 

P(r)  =  dhuj  {t)ld\nt.  In  the  asymptotic  regime,  the  scaling  exponent  of  the  velocity  corre¬ 
lation  function  approaches  its  asymptotic  limit,  and  the  transient  effect  from  finite  length 
scales  is  damped.  The  asymptotic  scaling  exponent  of  the  fluid  is  determined  by  the 
asymptotic  scaling  exponent  of  the  random  velocity  field  only.  Therefore  fractal  random 
velocity  field  models  can  be  used  for  the  study  of  the  asymptotic  scaling  relationship 
between  the  fluid  and  the  random  velocity  field  [14,25].  For  a  random  velocity  field  with 
an  asymptotic  scaling  exponent  less  than  -1,  the  asymptotic  scaling  behavior  of  the  fluid 
is  Fickian.  In  this  case,  a  pure  fractal  random  velocity  field  leads  to  an  ultraviolet  diver¬ 
gence.  Any  cutoff,  which  removes  such  divergence,  changes  the  random  velocity  field 
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from  a  fractal  field  to  a  multi-fractal  field.  Since  the  introduction  of  the  ultraviolet  cutoff 
only  affects  the  scaling  behavior  of  the  random  velocity  field  at  short  length  scales,  it 
does  not  affect  the  asymptotic  scaling  relation  between  the  fluid  and  the  random  velocity 
field.  The  situation  is  quite  different  for  a  random  velocity  field  with  an  asymptotic  scal¬ 
ing  exponent  larger  than  zero.  In  this  case  a  pure  fractal  leads  to  an  infrared  divergence. 
Since  the  infrared  cutoff  changes  scaling  behavior  of  the  random  velocity  field  at  large 
length  scale,  it  intrinsically  affects  the  asymptotic  scaling  relationship  between  the  fluid 
and  velocity  field.  Therefore,  for  infrared  divergent  systems,  the  asymptotic  scaling  rela¬ 
tionship  between  the  fluid  and  the  random  velocity  field  depends  on  the  rate  at  which  the 
infrared  cutoff  tends  to  zero  as  shown  in  (4)  and  (5)  [28,25]. 

The  multi-length-scale-fractal  theory  presented  here  has  been  validated  by  the 
exact  numerical  computations  using  the  front  tracking  method  [7-10].  The  comparisons 
between  the  exact  numerical  results  and  the  results  predicted  by  the  multi-length-scale- 
fractal  theory  have  been  made  for  a  system  with  asymptotic  non-Fickian  diffusion 
(poo  =  -0.5)  and  a  system  with  asymptotic  Fickian  diffusion  (Poo  =  -<»).  Different 
strengths  of  heterogeneity  are  studied  for  each  system.  The  studies  show  that  the  predic¬ 
tions  of  multi-length-scale-fractal  theory  agree  very  well  with  the  numerical  results  over 
the  full  range  of  the  length  scales  computed  which  is  up  to  the  length  scale  1(X)  times 
larger  than  the  length  scale  of  the  basic  heterogeneity.  We  comment  that  in  these  studies, 
no  fitting  parameter  is  involved  for  the  results  of  the  multi-length-scale-fractal  theory. 

According  to  (4),  the  asymptotic  scaling  exponent  of  the  mixmg  layer  is  0.75  for 
Poo  =-0.5.  The  scaling  exponent  ^(t;  obtained  from  the  numerical  computations 
decreases  monotonically  from  its  initial  value  close  to  1.0  to  its  final  value  0.85  at  the 
end  of  the  computation.  In  other  words  at  the  length  which  is  100  times  larger  than  the 
length  scale  of  the  basic  heterogeneity,  the  scaling  exponent  still  has  not  reached  its 
asymptotic  limit.  For  P*  =  -«»,  yoo  =  0.5.  That  limit  is  approached  within  10%  at  a 
length  scale  larger  than  50  units  of  the  basic  heterogeneity  length  scale.  Both  these  two 
cases  show  that  the  transient  effects  are  very  important  for  determination  of  the  scaling 
on  finite  length  scales.  See  [7-10]  for  the  detiuls  of  these  numerical  studies. 

The  study  of  the  diffusion  induced  by  random  velocity  field  has  also  been  carried 
out  in  the  Lagrangian  picture  by  applying  Taylor  diffusion  theory  [5,  6,  22,  25,  26]. 
Under  Corrsin’s  hypothesis,  the  effective  equation  for  the  variance  tensor  of  fluctuations 
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of  the  tracer  particle  displacement  is  a  (xnnplicated  non-linear  ordinary  differential  equa¬ 
tion.  Recently,  it  has  been  shown  that  due  to  the  nonlinearity  of  the  elective  equation, 
the  mixing  regime  induced  by  a  random  field  is  not  a  fractal,  even  if  the  random  field 
itself  is  a  fractal  [25,26].  In  addition  to  the  asymptotic  scaling  exponents,  the  asymptotic 
coefficients  in  front  of  the  asymptotic  power  law  in  front  of  the  asymptotic  power  law  for 
the  size  of  mixing  layer  and  effective  diffusion  coefficient  have  also  been  determined  for 
a  fractal  random  velocity  field.  See  [25]  and  [26]  for  details. 
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stability  Analysis  of  Stochastic  PDE*s  via  Lyapunov  Functionals^ 

Pao-Liu  Chow 
Department  of  Mathematics 
Wayne  State  University 
Detroit,  Michigan  48202 

ABSTRACT  The  paper  is  concerned  with  the  stability  of  stochastic  partial  differential  equa¬ 
tion  of  parabolic  Ito  type.  By  the  method  of  Lyapunov  functionals,  we  examine  three  kinds 
of  stochastic  stability:  the  stability  in  probability,  the  asymptotic  moment  stability  and  the 
almost  sure  asymptotic  stabihty.  Sufficient  stability  conditions  are  given  and  some  illustra¬ 
tive  examples  are  provided. 

I.  miRQmjcTiQN 

To  illustrate  some  basic  ideas  in  stochastic  stability,  we  will  first  consider  a  simple  example 
in  stochastic  ordinary  differential  equations  (ODE’s).  The  example  is  given  by  the  following 
ltd  equation  in  one  dimension: 

dxt  =  —axtdi  -f  (TXtdbt,  (1) 

where  a,  a  are  positive  parameters  and  5j  is  the  standard  Brownian  motion  in  one  dimension. 
Clearly  Xt  =  0  is  a  solution.  The  question  is  whether  the  null  solution  is  stable.  To  find  it 
out,  let 

^pt{x)  =  x^,  p>0. 

^This  work  was  supported  by  the  NSF  grant  DMS-91-01360. 
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By  the  ltd  formula  [1],  we  have 


so  that 


or 


=  ^p(xo)  +  2p{i(2p  -  l)<r’  -  o] 

+  2p<T  £  (p^{x,)db„ 

=  <Pp(xo)  +  2p(|(2p  -  l)a’  -  o]  E(pp(x,)ds 


E<Ppixt)  =  V>p(*o)exp{2p{i(2p  -  1)<t’  -  a]<}. 


Thus,  as  t  — »  oc,  we  get 

( 

0,  if  a  >  i(2p- l)a*, 
oo,  if  a  <  3(2p-  1)<T^, 
which  shows  that  the  (asymptotic)  moment  stability  depends  on  the  order  2p.  For  instance, 


E(pp{xt) 


for  p  =  1,  the  null  solution  is  mean-square  stable  if  a  =  2<7*,  but  is  not  4th-moment  (p  =  2) 
stable.  In  fact,  for  any  given  a  and  <r,  there  exists  a  m  >  0  such  that  the  null  solution  is 
mth- moment  unstable.  On  the  other  hand,  the  equation  (1)  has  the  ex2u:t  solution: 

xt  =  xo€xp{abt  -  (o  -I- 

By  the  strong  law  of  large  numbers,  we  have  (b(/t)  — ♦  0  almost  surely  (a.s.)  so  that 

Xt  — »  0  a.s.  as  t  — »  oo, 

for  a  >  Hence  the  null  solution  is  a.s.  as}miptotically  stable  for  any  a  >  0,  (t  >  0. 

In  contract  with  a  deterministic  equation,  this  simple  example  illustrates  many  faces  of 
a  stochastic  stability  problem.  For  instance,  the  asymptotic  stability  can  be  discussed  in 
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the  sense  of  probability,  in  the  pth-moment  or  in  the  almost  sure  sense.  Which  mode  of 
convergence  we  choose  depends  on  the  nature  of  the  problem  under  consideration. 

Now  consider  a  general  Ito  equation  in  one  dimensi< 

dxt  =  a{xt)dt  +  a{xt)dbt,  (2) 

where  the  functions  a  and  a  are  sufficiently  smooth  with  a(0)  =  cr(0)  =  0.  Again  it  =  0  is  a 
solution  of  Eq.  (2).  Since  the  equation  is  no  longer  solvable  in  a  closed  form,  to  determine 
the  stability  of  the  null  solution,  other  approach  is  needed.  As  in  the  deterministic  case,  a 
qualitative  method  based  on  the  Lyapunov  function  has  been  employed  by  many  workers  to 
study  the  stochastic  stability.  For  a  systemetic  exposition  of  this  subject,  one  is  referred  to 
the  excellent  book  by  Khasminskii  [2].  Since  the  emergence  of  stochastic  partial  differentiaLl 
equations  (PDE’s)  in  1970’s,  the  corresponding  stability  questions  have  arisen  naturally. 
In  1982  the  author  introduced  the  method  of  Lyapunov  functionals  to  study  the  stability 
of  stochastic  PDE’s  [3].  About  the  same  time  Ichikawa  adopted  a  similsur  apporach  to 
analyze  the  stability  and  related  questions  [4].  Recently  Khasminskii  and  Mandrekar  [5]  have 
examined  the  linearized  stability  problems  of  stochatic  PDE’s  by  the  Lyapunov  method. 

In  this  paper  we  shall  first  introduce  stochastic  PDE’s.  Then  we  review  some  basic  results 
in  stochastic  PDE’s  and  define  the  Lyapunov  functional.  Subsequently,  via  the  Lyapunov 
functional  approach,  we  present  some  stability  results  in  the  sense  of  probability,  the  asymp¬ 
totic  stability  in  pth-moment  and  the  almost  sure  asymptotic  stability  of  some  nonlinear 
stochastic  PDE’s.  Some  examples  will  also  be  given  for  the  purpose  of  illustration. 

II.  STOCHASTIC  PDE’S 

Let  D  =  (0,1)  be  a  unit  interval  and  let  K  =  L^{D)  be  the  Hilbert  space  of  square- 
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integrable  functions  on  D.  For  <p  £  K,  define  the  integral  operator  Q  by 


{Qtp){x)  =  [  q(x,y)  <piy)dy,  (3) 

JD 

where  the  kernel  q{x,  y)  is  symmetric  and  positive  so  that 

Tr.Q  =  j^q{x^x)dx  <  cx>.  (4) 

Let  {e„}  be  the  set  of  orthonormal  eigenfunctions  of  Q  with  eigenvalues  A„,  n  =  1,2, . . . 
By  (4),  we  have  A„  >  0  and 

Er=iAn  <  oo. 

Denote  by  W{t,x)  the  Q- Wiener  process  in  /if  if 

W,  =  W(t,..)  =  E-i6rc«,  (5) 

where  {6J*}  is  a  sequence  of  independent,  identically  distributed  (i.i.d.)  Brownian  motions 
in  one  dimension.  Then  we  have 


EW{t,x)  =  0  (6) 

and 

E{Wt,9){W„<p)  =  E  I  W{t,x)9{x)ds  f  W{t,y)<p{y)dy 

JD  Jd 

=  S~iA„(c„,0)(e„,v?)(t  As) 

=  (tAs)((?0,<^),  (7) 

where  use  was  made  of  the  fact  Eb^b^  =  (<  A  s)6mn}  and  {t  As)  =  min{<,s}.  The  Gaussian 
process  Wt  satisfying  the  properties  (6)  and  (7)  is  called  a  ^-Wiener  process  in  K. 
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As  an  example  of  stochastic  PDE,  consider  the  reaction  •  diffusion  equation  with  a  random 
drift: 

u{0,x)  =  <p{x)  (8) 

u(<,0)  =  u(f,l)  =  0, 

where  Wt  —  H^(/,  •)  is  a  Q- Wiener  process  defined  as  above  and  W  =  is  formally  a 
A’-valued  white  noise.  Introduce  H  =  L^(D)(=  K  in  this  case),  with  inner  product  (•,  •)  and 
norm  ||  •  ||.  Let  V  =  Hq{D)  =  {(f  e  H  :  ^  E  H  and  ^(0)  =  ¥>(1)  =  0}  with  norm  ||  •  ||i  and 
let  V'  be  the  dual  space  of  V .  For  v  E  V  and  v*  E  V\  the  linear  functional  v*  evaluated  at 
V  is  written  as 

«*(«)  =s<  u*,  V  >  . 

Now  we  define  the  operator  A  and  B  as  follows: 

A/  \  tt 

>»(>>)  =  + 

B(v)  = 

If  /(x,  y)  is  bounded  and  smooth,  we  have  A:V  —*V'  and  B  :V  -*  C^{K,  H),  which  denotes 
the  space  of  Hilbert-Schmidt  operators  from  K  into  H.  By  using  the  above  notations  and 
setting  Ut  =  u{t,  •),  Eq.  (8)  can  be  regarded  as  an  Ito  equation  in  V  : 

I  dut  =  A{ut)dt  +  B{ut)dWt, 

y 

.  Wo  =  V?, 

or  as  an  integral  equation 

Ut  =  V?  +  /  A(u,)ds  +  f  B(u,)dW,, 

Jo  Jo 
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where  (p  ^  H  and  the  last  integral  is  a  ^-valued  Ito  integral.  The  above  is  a  special  case  of 
a  large  class  of  quasilinear  parabolic*  Ito  equations  in  a  domain  D  C  which  have  been 
studied  by  many  authors.  For  references,  see  Pardoux  [6],  Krylov  and  Rozovskii  [7],  DaPrato 
and  Zabczyk  [8]. 

111.  LYAPUNOV  FUNCTIONALS 

In  general  let  V  and  H  be  real  separable  Hilbert  spaces  such  that  V  C  H  C  V'  and 
the  inclusions  are  dense  and  compact.  By  using  the  same  notations  as  in  §11,  consider  the 
following  Ito  equation  in  V': 

,■  dut  =  A{ut)dt  +  B{ut)dWt, 

1  “0  =  (9) 

where  >4(0)  =  B{Q)  =  0.  Under  suitable  conditions,  the  above  equation  has  a  unique  strong 
solution  Uf  satisfying  [6]: 

T 

E  sup  ||uf II’  +  E  f  llufllidf  <  oo,  for  any  7*  >  0  and  p  >  2.  (10) 

o<«<T  Jo 

As  in  finite-dimension,  the  Ito  formula  plays  an  important  role  in  stochastic  analysis.  For 
a  smooth  function2Ll  $  on  /f ,  let  D^(v)  and  Z?’$(u)  denote  the  first  and  the  second  Frechet 
derivatives,  respectively.  Then  the  Ito  formula  associated  with  Eq.  (9)  reads 

$«)  =  ^(V’)  +  f  C^u‘:)ds  +  B{u':)dW,),  (11) 

JO  Jo 

where 

£$(«)  =  ^Tr.[D^^v)B{v)QB*{v)]+  <  A{v),  D^v)  >  .  (12) 
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Let  us  give  two  exaunples: 
Ex.l)  $(u)  =  ||vf,  veH. 
Then 


D^{y)  =  2v, 

=  2/, 

where  I  is  the  identity  operator  on  H. 

We  get 

£$(«)  =  Tr.[B{v)QB*{v)]  +  2  <  >l(u),i;  >  . 

Ex. 2)  $(v)  =  ^n||w||,  u  /  0. 

We  have 

D^v)  =  v/||t;||^ 

DH{v)  =  (//|Mn-(t;(8)v)/||vr 

where  ®  denotes  the  tensor  product. 

Hence 

CHv)  =  ^(irr.[B(<,)0B»l+ <>!(>,), v>) 

Let  be  a  regular  functional  on  a  neighborhood  U  of  the  origin  of  H.  It  is  said  to  be  a 
Lyapunov  functional  for  Eq.  (9)  if 

(i)  C{v)  <  0  for  any  v  eV  r\U  with  u  ^  0,  (13) 
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(ii)  $  is  uniformly  positive- definite  so  that  4(0)  =  0  and 

inf  $(v)  =  >  0  Vr  >  0.  (14) 

IMI>r  ' 

IV.  STOCHASTIC  STABILITY 

There  are  three  types  of  stochastic  stability:  stability  in  probability,  the  p-th  moment 
stability  and  the  almost  sure  (a.s.)  stability.  In  what  follows,  we  will  consider  the  first  type 
of  stability  and  the  latter  two  in  the  asymptotic  sense  as  <  — »  oo. 

We  say  that  the  null  solution  of  Eq.  (9)  is  stable  in  probability  if  its  solution  uf  satisfies 

lim  P{sup  ||uf  (I  >  e}  =  0  for  any  £  >  0.  (15) 

IMI-^  t>o 

The  null  solution  is  asymptotically  stable  in  p-th  moment  if  for  any  9?  6  C/,  we  have 

Um£||<r  =  0,  p>0,  (16) 

*•”*00 

and  it  is  a.s.  asymptotically  stable  if,  for  any  (p  £  U, 

Pl^liin  IKII  =  0}  =  1.  (17) 

The  three  types  of  stability  as  defined  by  (15)-(17)  can  be  discussed  with  the  aid  of 
a  Lyapunov  functional.  First  of  all  let  us  assume  the  existence  of  such  a  functional  $  in 
U  €  H.  We  shall  summarize  the  facts  about  the  stability  criteria  as  three  theorems. 
Theorem  1.  The  existence  of  a  Lyapunov  functional  $  in  t/  C  H  implies  that  the  null  solution 
is  stable  in  probability.  ^ 

The  proof  is  a  simple  consequence  of  the  following  Chebyshev  inequality: 

P{  sup  lluHl  >r}< 
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In  the  next  theorem,  we  give  some  sufficient  conditions  for  the  p- moment  stability. 
Theorem  2.  Suppose  that  there  exists  a  Lyapunov  functional  in  H  such  that 


(i)  Cill^ll"  <  ^h)  <  C2\\hr,  for  heH,  (18) 

(ii)  £$(v)  <  —0311011",  for  o  €  V,  v  ^  0,  (19) 

where  ci,C2,C3  are  positive  constants. 

Then  the  null  solution  is  asymptotically  stable  in  p-th-moment.  ^ 

The  proof,  based  on  the  Ito  formula  and  the  Gronwall  inequality,  is  rather  straightforward. 
For  the  finite-dimensional  case,  such  a  theorem  is  given  by  Khasminskii  (p.l86,  [2]).  Actually, 
in  this  case,  the  null  solution  is  exponentially  stable  in  p-th-moment.  The  last  theorem  in 
concerned  with  a.s.  asymptotic  stability.  For  this  to  be  true,  intuitively,  two  things  must 
happen:  Starting  from  any  point  in  H,  the  trajectory  uf  will  be  trapped  in  a  ball  Br  of 
radius  r  after  some  finite  time,  and,  once  inside  Bn  the  path  will  eventually  find  its  way  to 
the  origin  with  probability  one.  The  sufficient  conditions  are  given  in  the  following. 
Theorem  3.  Let  there  exist  a  Lyapunov  functioned  ^  in  H  such  that 

(i)  £$(«)  <  -c$(v),  (20) 

for  some  c  >  0  and  v  €V,  with  v  0. 

(ii)  sup{||g'/25»Z)$(t;)||/$(t;)}  <  M,  (21) 

vev 

where  M  >  0  is  a  contant. 

Then  the  null  solutionis  a.s.  asymptotically  stable, 


281 


The  proof  of  this  theorem  is  more  complicated  technically,  and  it  can  be  carried  out  as 
in  Thm.4.1  of  our  paper  [3]  with  a  minor  modification.  The  conditions  (i)  and  (ii)  ensure 
that  the  two  things  mentioned  above  would  indeed  happen  and  the  theorem  follows. 

V.  SOME  EXAMPLES. 

For  the  purpose  of  illustration,  let  us  consider  a  few  examples. 

Ex.l).  Consider  the  linear  SPDE: 

=  +  t>0,  0<x<l,  (22) 

subject  to  the  initial- boundary  conditions: 

,  u(0,x)  =  v(x), 

I  u(<,0)  =  u(<,l)=0,  (23) 

where  i/  is  a  positive  constant.  In  this  case,  we  let  JI  =  K  =  L^(0, 1)  and  let  Hq  be  the 
Sobolev  space  as  in  §II.>It  was  shown  by  Khasminskii  and  Mandrekar  [5]  that  the  functional 

*(»)  =  jT  EKWl*  (24) 

is  a  Lyapunov  functional,  where 

Ml = ib.ip + iiv’ii^  =  Ab'(^)i^ + Mx)ndx. 

Jo 

Therefore,  by  Thm.l,  the  null  solution  is  stable  in  probability.  In  fact  the  Lyapunov  func¬ 
tional  (24)  satisfies  the  conditions  (18)  and  (19)  of  Thm.2  for  p  —  2.  Thus  the  null  solution  is 
asymptotically  stable  in  second  moment  or  in  mean-squ^^^e.  Moreover,  if  the  covariance  func¬ 
tion  q{x,y)  of  the  Q- Wiener  process  W{t,x)  is  bounded  and  continuous,  then  the  conditons 
(20)  and  (21)  for  Thm.3  are  met.  Hence,  in  this  case  the  null  solution  is  a.s.  asymptotically 
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stable. 


Ex.2).  Instead  of  Eq.  (22),  we  consider  the  following  nonlinear  equation: 


du(x,t)  d^u  f  u  \ ^  ^ 


<  a:  <  1, 


(25) 


with  the  saume  initial- boundary  conditions  (23). 
Let 


$(u)  =  \\v\\\  (26) 

Referring  to  Eq.  (9),  we  have  A  =  and  B[y)  =  Then  it  is  easy  to  get 

£$(u)  =  2  <  Av,v  > -{■Tt.[B{v)QB*{v)\ 

=  ,(x, 

<  —2v  f  \vx(x)\^dx  +  f  g(x,  x)v^(x)dx.  (27) 

Jo  Jo 

Therefore,  if 

/  q(x,x)v^{x)dx  <2i/  f  fv^^dx, 

Jo  Jo 

we  have 

£$(«)  <0  for  u  e  Hq, 

so  that  $  given  by  (26)  is  a  Lyapunov  functional  and  the  null  solution  is  stable  in  probability. 
Suppose  that  q  is  bounded  and  continuous  with 

qo  =  max  g(x,x). 


Then  (27)  yields 


£$(«)<  (~2Ao  +  9o)$(t;). 

If  the  following  condition  holds, 

qo  <  2Ao, 

then  it  is  not  difficult  to  verify  tht  the  conditons  for  both  Thni.2  and  Thm.3  are  fulfilled. 
We  can  conclude  that  the  null  solution  is  as5nmptotically  stable  in  mean-square  as  well  as 
with  probability  one. 

Ex.3).  Let  D  C  be  a  bounded  domain  with  a  smooth  boundary  dD.  We  consider  the 
the  reaction-diffusion  equation  with  a  random  perturbation: 

=  i/Au /(it)  4- W(<,x)|Vu|,  f  >  0,  X  e  D, 

=  v(u),  (28) 

=  0, 


du(t,x) 
~  dt 

“  it(0,  x) 

t 


where  /  is  a  locally  Lipschitz  continuous  function  with  at  most  a  polynomial  growth  and 
/(O)  =  0.  Consider  the  scalar  ordinary  differential  equation: 


which  is  assumed  to  possess  a  Lyapunov  function  (p{r)  so  that  <p"{r)  <  0,  <fi{r)f{r)  >  0  and 
(p{r)  ^  oo  as  |r|  — »  oo. 

Let  us  introduce  H  =  K  =  L^{D),  =  {v  e  H  :  |Vt;|  €  H  and  v\dD  =  0}  and 

V  =  L^{D)  n  Hq  for  some  p>2.  We  define 


$(t;)  =  J  {p[v{x)]dx. 


(29) 
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The  following  calculations  are  straightforward: 


£$(v)  =z  —  u  f  ^"(t;)lVv|*dx  +  f  (p{v)f{v)dx 
Jd  Jd 

V>”(w)9(ac.*)|Vvpdx 

If  q  is  bounded  and  continuous  with 


go  =  sup  g(x,  x)  <  2i/, 
*€Z> 


then 

£$(«)  <0  for  V  €V 

so  that  $  defined  by  (29)  is  a  Lyapunov  functional  and,  by  Thm.l,  the  null  solution  is  stable 
in  probability.  But,  for  this  example,  the  asymptotic  stability  results  are  difficdR  to  get  with 
making  further  assumptions. 
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Abstract 

Preliminary  results  on  the  analysis  and  computation  of  approximate  solutions  to  a  simple 
mathematical  model  for  high  strain  rate  shear  deformations  of  a  thermo-plastic  material  in  one 
and  two  dimensions  are  discussed.  This  problem  involves  a  system  of  time-dependent  partial 
differential  equations.  Open  questions  on  the  qualitative  behavior  of  the  solutions  are  given 
and  the  results  of  numerical  calculations  that  address  these  issues  are  presented.  Several  time 
discretization  procedures  that  retain  certain  fundsunentzd  properties  of  the  system  of  partial 
differential  equations  are  described.  Finally,  an  optimal  order  error  estimate  for  a  semidiscrete 
finite  element  method  is  furnished. 

Presented  at  the  Eleventh  Army  Conference  on  Applied  Mathematics  and  Computing,  Caxnegie 
Mellon  University,  Pittsburgh,  PA,  8-10  June  1993. 
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1  The  Shear  Band  Model: 

Consider  high  strain  rate  shear  deformations  of  a  thermo-plastic  material  occupying  a  region  fi  x 
(—00,00)  where  fi  =  (0,1)  for  the  one- dimensional  problem  and  is  a  bounded  domain  in  the  xy 

•PartiaUy  funded  by  the  Army  Research  Office  through  grant  28535-MA 
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plane  for  the  two-dimensional  problem.  We  assume  the  deformations  are  of  antiplane  sheax  type  in 
the  two-dimensional  case  and  the  applied  forces  and  boundary  conditions  depend  only  on  z  €  SI. 
Thus,  the  quantities  we  seek  will  also  depend  only  on  x  6  fi  and  time,  t.  The  model  we  describe  is 
based  on  the  assumption  that  thermal  softening  is  due  to  the  heat  energy  generated  by  the  plastic 
work.  We  neglect  strain  hardening  and  focus  on  the  strain  rate  effects. 

The  mathematical  problem  consists  of  an  equation  for  balance  of  momentum 

(1)  V  — V*<7  =  0  on  SI,  <>0 


and  balance  of  heat  energy 

(2)  6  —  XA8  =  K(T  •  Vt>  on  ft,  t  >  0 

where  i  denotes  the  time  derivative  of  z,  for  simplicity  we  set  density  />  =  1, 


or  V  = 


*dx'^^dy' 


X  is  the  thermal  conductivity,  k  gives  the  conversion  of  mechanical  energy  to  heat,  v  =  v{x,t)  is 
the  displacement  velocity,  a  =  <T(x,t)  is  the  stress,  amd  9  =  9(x,t)  is  the  temperature.  For  the 
constitutive  law  we  take 

(3)  <r  =  ^(9)Vv 

where  we  consider  fi(0)  =  for  constant  a  >  0  and  fi(9)  =  9‘'  for  constant »/  <  0.  To  complete 
the  specification  of  the  initial/boundary  value  problem  we  have  conditions 


(4) 


<r  •  n  =  (Ti  •  n  on  F,  and  v  =  vi  on  r„ 


where  5ft  =  F,  U  r„  as  well  as 

d9 

(5)  0  =  0  on  Fo  and  ^  ^ 

where  5ft  =  Fd  U  F^v-  Here  n  is  the  outward  pointing  unit  normal  to  ft,  ai  and  Vi  are  given 
functions.  We  also  need  initial  conditions 


(6)  V  =  Vo  and  0  =  0o  on  t  =  0. 

Our  goal  in  this  work  is  to  understand  the  mathematical  mechanisms  that  lead  to  shear  bands. 
We  will  study  the  qualitative  aspects  of  this  time  dependent  model  through  scientific  computations. 
An  important  part  of  this  research  is  to  justify  the  numerical  algorithms  as  much  as  is  possible. 

The  literature  on  these  problems  is  extensive.  We  note  [B]  for  scientific  computations  on  an 
elastic  plastic  multi-dimensional  model,  [DF]  for  computations  with  a  moving  spatial  mesh  with 
variable  time  steps  in  one  dimension,  [MM]  for  a  characterization  of  steady  state  solutions,  [T] 
for  analysis  of  “blowup”,  and  [W]  for  scientific  computations  on  a  one-dimensional  elastic-plastic 
model. 
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2  Qualitative  Behavior  of  Solutions: 


We  describe  briefly  several  analytic  results  for  the  pairtial  differential  equation  in  this  section. 

Maddocks  and  Malek-Madani  [MM]  give  a  functional  which  would  be  Lyapunov  if  it  was  known 
that  the  problem  (l)-(6)  had  a  unique  global  solution.  We  present  the  short  derivation  of  this 
property  in  the  one-dimensional  case  with  fi{0)  =  e~^^,  k  =  A  =  1,  and  Dirichlet  boundary 


conditions 

(7) 

«(0, ()  =  «(!, ()  =  »(0.‘)  =  »(1.1)  = 

Let 

and  note 

(8) 

^  =  -0^x9  -1-  aV: 

Multiplying  equation  (1)  by  u  and  (2)  by  6,  adding  the  resulting  equations,  integrating  with  respect 
to  X,  and  using  integration  by  parts  we  have 

+  e^)dx  = 

From  (8)  and  the  boundary  conditions  we  obtain 

(9)  =  + 

where  ^ 

/(«,».)  =  j[  + 

Tzavaras  [T]  proves  there  will  be  no  global  solution  in  the  adiabatic  (A  =  0)  case.  We  now  give 
the  short  “blowup”  argument  from  [Tj.  We  take  fi(9)  =  =  1  for  t  >  0,  and  ^(i,0)  =  1 

for  X  €  fi.  Let  U{9)  =  —9~^  and  note  U'  =  /r.  From  (2)  with  k  =  1  we  have 

9  =  <TVx- 


Multiplying  by  fi{9)  we  have 

Integrating  from  0  to  T,  setting  x  =  1,  and  using  the  boundary  conditions  on  a  yields 

U{9{\,T))  =  V{9{\M^T. 
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From  the  initial  condition  and  definition  of  U  we  have 

Thus  the  temperature  will  “blowup”  as  T  -+  1.  A  natural  question  is:  What  happens  when  A  >  0, 
the  non-adiabatic  case? 


3  Numerical  Methods: 

In  this  section  we  describe  several  numerical  methods  which  have  a  finite  element  spatial  discretiza¬ 
tion  combined  with  a  finite  difference  time-step  scheme. 

Let  Mix  be  a  space  of  continuous  piecewise  polynomials  of  degree  <9-1  defined  on  a  “trian¬ 
gulation”  of  SI.  These  elements  have  diameter  =  h. 

For  simplicity  we  assume  we  are  approximating  problem  (l)-(6)  with  homogeneous  Dirichlet 
boundary  conditions  on  v  and  d.  A  spatially  discrete  continuous  in  time  finite  element  method  is: 
find  ivii{'xt),0ix{-,t))  £  Mh  x  Af/,  such  that 

(w/iiX)  +  =  0  Vx  e  Mh, 

{h,  <p)  +  H'^6h,  V<p)  =  «(m(0a)I Vt;A|^  <p)  V<p  G  Mh 
for  0  <  t  <  T  where  Vh(’,0)  ^  vq  and  ^^(•,0)  =  Oq.  In  this  method 


iw,z)=  I 

Ja 


wz  dA. 


In  [FG]  it  is  shown  under  the  assumption  problem  (l)-(6)  has  a  unique  smooth  solution  on  an 
interval  [0,  T]  that 

-»/*(•»  Oil  <  Ch’f 


<  Chi 

where  C  is  a  constant  depending  on  derivatives  of  6  and  v  but  not  on  h.  Here  ||z|p  =  /q  z'^dA.  These 
rates  are  “optimal”  in  the  sense  that  they  are  the  best  one  could  expect  from  the  approximation 
space  used. 

We  now  turn  to  the  time  discretization.  Define  div^  :  {L^{Sl)Y  — ^  Mh  where  d  =  1  on  2  by 

(div/i77,x)  =  -(9.  Vx)  Vx  G  Mh, 

Ah  :  Mh  Mh  by 
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(A/i^,x)  =  -(Vf.Vx)  Vx€Ma, 

and  let  ith  be  the  Z^-projection.  With  this  notation  the  backward  Euler  method  on  the  nth  time 
step  is 

(10)  -  <iivi(M»S)v»;+')  =  0 

(11)  -  »);)  -  =  n  (;<(«ll)|V<,rP) . 

Here  fc  >  0  is  the  time  step. 

The  scheme  is  fully  implicit  if^=:m  =  n  +  l.  We  used  this  scheme  in  two  dimensions  with 
a  fixed  point  iteration  to  solve  the  nonlinear  systems  and  preconditioned  conjugate  gradients  to 
solve  the  linear  systems.  The  scheme  is  called  semi  implicit  if  ^  =  m  =  n.  We  used  this  scheme  in 
one-dimension  with  a  tridiagonal  solver  for  the  linear  systems. 

Many  researchers  have  investigated  time  discretization  schemes  that  preserve  a  discrete  energy 
or  Lyapunov  functional  (see,  for  instance,  [DN],  [E],  [G],  [SS]  or  [FS]).  In  one  dimension  with  the  list 
of  assumptions  used  to  derive  inequality  (9)  the  backward  Euler  scheme  with  £  =  n  +  1  and  m  =  n 
retains  a  discrete  version  of  this  inequality.  To  see  this  multiply  the  first  equation  by  -  vJJ 
and  integrate  with  respect  to  i.  After  integration-by-parts  noting  •)  =  .)  =  0  we  have 

|l„n+l  _  t,n||2  ^  k  -  <x)  =  0 


or 


(12)  ||»r'  -  v);ii=  + 1 

+1  ("K'  -  <4  ’  ("S’ "  ’’I.*))  =  “■ 


Labeling  the  term  inside  the  [•]  as  7  we  have 


J  =  -  ((e 


-20?'*'^  ^-20?)  „n 


e  ‘■'h 


Applying  Taylor’s  Theorem  we  have 

+  (-2e-«”‘)  i  -  «;+■)’ 


SO 
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Substituting  this  back  in  (12)  yields 


+  |[( 


+k  (e-"”'  («;»'  -  9;) 

+k  (e-^'  (sr' 

^-(e-«r  (.K' ^l)  =  o 


Multiplying  (11)  by  -  djj)  we  obtain 

W'  -  «kll"  +  I  (Ki'll'  -  =  *  ( 


Adding  this  to  (13)  gives  a  discrete  Lyapunov  functional  equation  for  the  backward  Euler  method, 

+  P  =  Arf;" 


where 


=  f[( 


£“  =  i||9U’  +  ^('''"'(<j“.i) 

(<:'  -  oi;,)' .  1))  +  + k  (<,)' ,  1) . 


The  following  scheme  also  retains  a  discrete  version  of  (9): 


*  \  / 


!(«;+>  -  «;)  -  +«;)  =  -»*  ^ 


rK*'.<'S^)-EW,vSJ 


K*'  -  K 


Multiplying  the  first  equation  by  A:~^(t;^'''^  -  v^)  and  the  second  by  -  9^)  gives  the 

following  after  adding  the  two  equations  and  integrating  with  respect  to  i: 

=  -k  (ii<.--’  (-.r  ‘  -  <)  IP + 11*“'  («r‘  -  n)  II") 

Finally,  we  note  that  higher  order  accurate  schemes  can  be  created  using  the  finite  element  in  time 
techniques  described  in  [FS]. 
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4  Numerical  Experiments: 

In  this  section  we  show  the  results  of  some  of  our  preliminary  computations.  More  experiments  are 
planned  and  we  intend  to  use  more  sophisticated  adaptive  methods. 

Figure  1  shows  the  results  of  a  computation  in  the  adiabatic  case  (A  =  0).  As  discussed  earlier 
[T]  has  shown  that  the  temperature,  6,  will  “blowup”  at  the  x  =  1  boundau-y  as  t  —  1.  VVe  take 

^{6)  -  ax  =  1,  vo(x)  =  ^x^,  and  ffo(x)  =  1 

A  refined  mesh  and  variable  time  step  strategy  were  used.  The  top  graph  has  a  snapshot  of  0/i 
at  T  =  0.9.  The  bottom  graph  has  the  evolution  of  at  i  =  1  plotted  against  its  known  values 
(l-t)-K 

Figure  2  has  the  results  from  nearly  the  same  calculation  except  we  took  A  =  0.1,  added 
Neumann  boundary  conditions  for  0,  and  stopped  at  T  =  1.25.  In  this  case  it  can  be  shown  that 

d(l,t)=  \ 

So,  to  estimate  the  behavior  of  ^/i(l,  t)  we  used  a  centered  difference  approximation  of  ^0k,xjc”-  These 
are  the  values  on  the  dashed  line  in  the  bottom  graph.  Certainly  from  this  numerical  evidence  it 
appears  that  the  solution  will  “blowup”. 

Finally,  in  figure  3  we  explore  “blowup”  in  two  dimensions.  We  have  the  adiabatic  case  (A  =  0), 
fi(0)  =  0~^,  VQ{x,y)  =  |(1  -  x^)y^,  and  0oix,y)  =  1.  The  domain  is  =  (-1,1)  x  (0,1).  We  took 
V  =  0  and  0  =  1  on  all  boundaries  except  the  segment  of  the  line  y  s  1.  On  that  boundary  we  had 
=  0  and  (7  •  n  =  (1  -  x^).  We  used  a  40  x  40  refined  mesh  and  a  variable  time  step  with  400 
steps.  Again  we  see  a  possible  blowup  point  starting  to  form. 
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Figure  3 


5  Specialized  Finite  Difference  Method: 


We  finish  this  report  by  describing  a  finite  difference  scheme  for  the  adiabatic  problem  that  has  the 
same  “blowup”  as  described  earlier  from  [T].  The  scheme  will  give  approximations  to 

(14)  V  =  <Tx  on  (0, 1),  t  >  0, 

(15)  e  =  n{d)vl  on  (0,1),  t>0. 


with  boundary  conditions 

‘^(0,t)  =  0,  a(l,t)  =  l,  t>0 

where  a  =  and  initial  conditions  are  provided  for  9  and  v. 

Let 

92  ^9(£h,nk) 


^?+i/2  -  v((e  +  l/2)h,nk), 
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n+1/2  _  „n+l/2 
”<+1/2  ^<-1/2 


where 


=  ii  -^-^----  -- 

”<"+//?  =  ^  (”<"h/2  +  ”"+1/2)  ■ 


The  scheme  for  (14-15)  is  then 

^  (<1%  -  -Jh/O  =  I  «.■"  -  ,  < = 0, 1, . . . .  i  - 1, 

1  («?*'  -«?)=/!((»?+', «?)  (/.-■  (.?«//  -  Cv?))' ,  /  =  0, 1, . . . ,  i 

and 

n+1/2  _  ^  -n+1/2  _  , 

<Tq  —  U,  (Tf^  —  1. 

We  now  show  that  the  approximation,  will  “blowup”.  Take  /i(v?)  =  so  U(<p)  = 
and  set  ^  =  1  for  /  =  0, 1, . . . ,  X.  Multiplying  the  second  equation  by  we  have 

M  -  ^?) = *  • 

At  /  =  X  we  have  =  1  so  this  becomes 


Iterating  this  formula  we  have 


{?((?”+>) -P(IIJ)=I:. 


(O'*  =  («S)‘'  -  f'* 


l-JVt 


which  proves  the  “blowup”. 
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A  System  for  Materials  of  Korteweg  Type 
in  Two  Space  Variables 

Harumi  Hattori*aiid  Dening  L*d 
Department  of  Mathematics 
West  Virginia  University 


1  Viscous  Isothermal  Motion  of  Korteweg  Type  Materials. 

We  consider  a  simplified  isothermal  motion  of  the  Korteweg  type  materials  where  the 
viscous  effect  is  'included.  The  general  model  for  the  Korteweg  type  materials  was  pro¬ 
posed  by  Dunn  and  Serrin  [1]  with  the  interstitial  working  term  and  they  modified  the 
system  of  compressible  fluids  based  on  the  Korteweg  theory  of  capillarity.  We  will  restrict 
ourselves  to  the  special  isothermal  case. 

With  the  Helmholtz  free  energy  chosen  as 

^  =  F{p)  +  Yp^Pl  +  P%  (1-U 

where  F  is  a  smooth  function  of  p  and  is  a  positive  constant,  we  obtain  the  following 
system  for 

Pt  +  (/w).  +  {pv)y  =  0, 

{pu)t  +  {pv?)t  +  {(mv)y  =  -p,  +  vpAp,  -I-  pAti  +  f  (u,,  -I-  Uxv),  (1.2) 

(pv)t  +  (PUV)*  +  {PV\  =  -Py  +  VpApy  +  PAV  +  f{Ury  +  Vyy). 

with  the  initial  data 


(p,  u,  u)(x,  y,  0)  =  (po,  uo,  t;o)(a:,  y).  (1-3) 

*The  work  was  8upp<Hrted  in  part  by  Army  Grant  DAAL  03-89-G-0088 
^The  work  was  suppcwted  in  part  by  ONR  under  Grant  N00014-91-J-1291. 
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Where  p'  >  0,  p*  >  0: 


jfio  p 

(1.4) 

tf(po)  =  0,  Po  =  const  >  0. 

(1.5) 

Hip)  >  6{p  -  po)\ 

(1.6) 

2  Local  Solution 

First  we  discuss  the  local  existence  of  solutions  for  the  initial  value  problem  of  (1.2)(1.3). 
Denote  by  ||  •  ||  ~  ||  •  ||o  the  norm  and  by  ||  •  ||;t  the  k-th.  order  Sobolev  norm.  Set 

|u.|iT  =  (iwoir  +  ||Vp(()f )  +  [  (llVu(l)ll'  +  l|Vt.(<)f )  dt  (2.1) 

and 

l>»lt  =  E  1^.0.“'!'.  •  (2-2) 

Ul<» 

where  w  s  (p,  u,  u).  Then  the  main  result  of  the  local  existence  is  the  following  theorem 

Theorem  1  Assume  the  initial  data  (1.3)  satisfy 

(po  ~  Pb»  Vo)  €  Po  ^  ^  >  0,  (2.3) 

where  k>i  and  pb  >  0  is  a  positive  constant. 

Then,  3T  >  0  such  that  Cauchy  problem  (1.2)(1.3)  has  a  unique  solution  U7  = 
(p  —  po,  u,  u)  in  (0,  T\  such  that 

1.  p-p-o€i«>([0,r];/f*+H«*)), 

2.  («,»)€  ' 

3. 

IwK  <  C,  (iKollJ  +  11/^,111+,)  . 

Where 

|t»|J=  sup  (iKt)ll' +  ||Vp(l)f) 

0<t<T  ' 

+  jf’'(||Vu(()r  +  l|Vu(()||*)*,  (2.4) 

lu.fi  =  Y. 

Ul<* 
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Tke  theorem  is  proved  by  estabUshmg  the  energy  estimate  for  the  linearized  problem 
and  linear  iteration. 

Since  the  linearized  problem  of  (1.2)  is  not  of  any  classical  type,  the  existence  of  so¬ 
lutions  is  not  known  even  for  the  linearized  problem.  We  prove  the  existence  of  solutions 
for  the  linearized  problem  by  establishing  an  energy  estimate  for  the  dual  problem  and 
then  using  the  dual  argument. 

The  linearization  of  the  problem  (1.2)(1.3)  is  the  following 

(dt  +  ud*  -I-  vdy)p  -I-  p(ur  +  Vy)  =  /i, 
p(dt  +  uds  -J-  vdy)u  +  p'(p)px 

'  -upApx  -  /lAu  -  f  (u*,  -I-  v^)  =  /a, 

p{dt  +  udx  +  vdy)v  +  i/(p)py 

-vpApy  -  pAv  -  f  (u^  +  Vyy)  =  /a. 

(^ii,t>)(x,y,0)  =  {^,uo,vo)ix,y). 

For  the  linear  problem  (2.5)(2.6),  we  have  the  estimate 

adl-illl  +  ll^llUl)  +  +  II*IIm. 

<  c,  (iiti.iii + mUi + ii/iii + ii/iiiwi) 

l<i«<C»(T)(|K||J  +  ||A,||J+.) 

The  estimate  (2.7)  is  derived  by  integrating  by  parts  the  inner  products  of  the  last  two 
components  of  (2.5)  with  (u,  u),  and  using  repeatedly  the  conservation  of  mass  equation 
to  treat  the  term  p(u*  + 1;„)  coming  from  the  non-symmetry  of  the  system. 

For  the  existence  of  solutions  for  linear  problem  (2.5)(2.6),  we  need  to  consider  the 
dual  problem 

=  -dt^  -  -  B;dy^ + (iT  +  =  g,  (2.8) 

^(x,y,r)  =  0.  (2.9) 

System  (2.8)  is  non-symmetric  with  the  principal  part: 

dt^  —  A(dx^2  +  dy^)  =  Pi, 

'  -l-  dx^i  +  A^2  +  dxy^  —  521  (2.10) 

,  +  dy^2  +  A^  +  dxy^  =  53j 


(2.5) 

(2.6) 

-  (2.7) 
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(2.11) 


For  the  system  (2.8)(2.9),  we  have  the  dual  estimate: 

IWOf  +  II*3«)II3  <  C  (||j(()ll’  +  jf'  l|j(r)||Vr) 

and  the  negative  order  estimate: 

IIA'^OII’  +  l|A‘+’^4(()r  <  C  (llA'j(t)r  +  [  l|A‘j(T)||*</r  j  .  (2.12) 

where  a  £  R  and  A  is  the  operator  with  symbol 

Mf.'?)  =  \/l  +  lfl'  +  W-  (213) 

The  estimate  (2.12)  is  obtidned  by  integrating  by  parts  the  inner  product  of  2nd  and  3rd 
equations  with  (^,  ^)  and  (A^,  A^).  The  negative  norm  estimate  (2.13)  is  derived 
similarly  by  applying  the  operator  A*  to  the  system.  Once  we  have  the  dual  estimate,  the 
existence  of  solutions  for  (2.5)(2.6)  can  be  established  by  the  standard  dual  argument. 

From  the  existence  of  solutions  for  the  linearized  problem  (2.5)(2.6)  and  their  estimate 
(2.7),  the  solution  for  the  nonlinear  problem  (1.2)(1.3)  is  obtained  by  linear  iteration. 
First,  we  construct  approximate  solution  t2;(z,y,t)  such  that: 

dtw  -  Aw  —  0,  t5(a:,  y,  0)  =  tuo(x,  y)  (2.14) 


so  that  the  Cauchy  problem  (1.2)(1.3)  is  transformed  into  a  problem  with  homogeneous 
initial  data: 

mw = -£(tfi).s = /.  , 

u>(x,y,0)  =  0. 

Take  tuo  =  0  to  begin  the  iteration  scheme.  The  successive  Wj+i{j  =  0, 1,---)  is 
decided  by  solving  the  following  problems 


I 


L{wj)wj^i  =  /, 
w'j+i(®>y,o)  =  0. 


(2.16) 


The  existence  of  the  iteration  sequence  in  a  common  interval  is  derived  from  the 
fact  that  the  norm  J^WjUH  is  uniformly  bounded.  And  the  convergence  of  the  iteration 
sequence  is  guaranteed  by  choosing  7*  <  1  such  that 


Iwill «  1,  (2.17) 

|iiiy  -  <  i  Itiij-i  -  (2.18) 

Consequently,  the  limit  of  the  iteration  sequence  is  the  desired  solution.  The  uniqueness 
of  the  solution  is  derived  readily  from  the  energy  estimate. 
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3  Classical  Global  Solution 

The  existence  of  global  solution  for  (1.2)(1.3)  can  be  obtained  from  the  local  existence 
theorem  in  section  2  by  using  the  Matsumura-Nishida  technique  in  [3].  We  have  the 
following 

Theorem  2  Assume  that 

(po  —  ^  ^  >  0.  (3-1) 

ll*^oll4  +  ||/>o  -  A)||5  ^  (3*2) 

Then,  for  eo  1,  there  exists  a  unique  solution  (p,u,v)  in  [P,oo)  such  that 

p-po€  L~([0,oo);ff»(fl’)),  (u,v)  €  L^dO^oo);  (3.3) 

and  satisfying 

I«>K<C{IKIIJ+Il«.-*I|0  (3.4) 

with  tu  s  (p  —  po«ti,v). 

The  theorem  is  proved  by  applying  the  following  lemma  similar  to  the  one  in  [3]. 
Lemma  1  Let  u;  be  a  solution  such  that 

sup  (||u;(0l|4  +  ||p(0  -  ^lls)  <  c.  (3.5) 

o<t<r 

If  e  is  small,  then 

sop  (IWOliJ  +  l|p(<)  -  A>lll)  <  C.(KII5  +  IlPo  -  PbllS).  (3.6) 

0<«<T 

Here  C*  is  independent  of  T. 

The  inequality  (3.6)  can  be  derived  by  looking  carefully  the  derivation  of  the  in¬ 
equality  (2.7).  We  will  need  the  assumption  that  p'  >  0,  p"  >  0  and  (1.4)(1.6),  as 
well  as  the  fact  that  the  "error”  terms  in  the  linearization  are  of  quadratic  order.  Once 
(3.6)  is  established,  the  existence  of  the  global  solution  follows  readily  from  the  standard 
continuation  argument. 
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Abstract 

The  addition  of  a  few  parts  per  miOion  (by  weight)  of  certain  long-chained  polymer  molecules 
dramatically  reduces  the  drag  of  a  turbulent  Sow,  often  by  a  factor  of  about  3  or  4.  This 
phenomenon  is  known  as  Toms  effect.  It  has  been  tested  and  expiated  in  applications  ranging 
from  oil  pipelines  to  n<^  reduction  of  submarines,  yet  little  is  understood  about  the  physics  of 
this  phenomenon.  It  is  generally  surmised  that  turbulent  flows  *somehow*’  stretch  the  polymers, 
thereby  increasing  the  viscosity  locally,  thickening  the  viscous  suUayer,  and  thus  reducing  the 
velocity  gradient  at  the  boundary.  What  is  lackiag  is  the  mechanism  by  which  large  scale 
turbulent  structures  can  stretch  individual  pdymer  molecules,  a  proUem  spanning  several  orders 
of  magnitude. 

In  this  paper,  we  adopt  a  hybrid  eOipodd-dumbbeD  modd  to  modd  the  dynamics  of  a 
polymer  molecule.  We  embed  these  modd  molecules  within  a  numerkd  simnlation  of  a  single 
streamwise  vortex  in  a  shear  flow.  The  effect  of  the  vortex  is  to  rotate  the  flow  gradient  within 
the  vortex  core,  thereby  establishing  inflectional  vdodty  profiles;  we  also  find  that  it  establishes 
regions  of  positive  streamwise  strain  rate  on  the  downflow  side  of  the  vortex.  These  regbns  are 
observed  to  be  of  sufficient  strength  to  stretch  the  pdymer  molecnle  to  experimentally  observed 
elongations.  Regions  of  negative  strain  rate  on  the  upflow  side  of  the  vortex  are  observed  to 
allow  the  molecules  to  rdax,  and  admit  the  possibility  of  entanglement.  We  estimate  the  local 
increase  in  viscosity  due  to  polymer  deformation,  and  use  this  to  infer  the  local  decrease  in 
strength  of  the  associated  vortex,  the  reduced  inflectional  velodty  profiles  within  the  vortex 
core,  and  the  increased  stability  of  the  flow  locally. 
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1  Introduction 

It  is  widely  suspected  that  turbulence  suppression  and  turbulent  drag  reduction  by  dilute  polymer 
additives  occur  b^ause  the  polymers  dissipate  energy  via  stretching.  However,  previous  investiga¬ 
tions  of  the  dynamics  of  polymers  in  uniform  shear  flow  have  shown  that  the  embedded  polymers 
undergo  too  little  deformation  to  dissipate  any  significant  amount  of  energy.  In  this  paper,  we  iden¬ 
tify  a  physical  mechanism  in  a  single  stream  wise  vortex  flow  that  is  capable  of  substantially  deforming 
polymers,  thereby  dissipating  enough  energy  to  help  stabilize  the  flow  and  reduce  drag.  We  identify 
this  mechanism  by  modeling  the  dynamics  of  a  polymer  molecule  which  has  been  embedded  in  a 
streamwise  dependent  vortical  flow.  We  investigate  how  the  flow  establishes  regions  with  positive 
and  negative  streamwise  strain  rates  of  sufiScient  strength  to  stretch  the  polymer  to  experimentally 
observed  deformations,  and  then  to  permit  relaxation,  possibly  with  subsequent  entanglement.  We 
also  discuss  the  resulting  predictions  of  the  single  vortex  model;  the  increase  in  viscosity  obtained 
from  energy  dissipation  about  the  stretched  polymer  decreases  the  vortex  strength  locally  near 
those  vortices  which  have  activated  polymers,  thus  reducing  the  inflectional  profiles  at  the  vortex 
cores,  and  so  preventing  those  vortices  from  undergoing  transition  to  turbulence. 

2  Modeling  the  Dynamics  of  a  Single  Polymer  Molecule 

We  first  present  a  model  for  the  dynamics  of  a  single  polymer  molecule  in  a  3-D  flow  field.  In 
a  previous  work,  Keyes  and  Abernathy  (1987)  successfully  employed  a  hybrid  ellipsoid-dumbbell 
polymer  model  in  uniform  shear  flow  to  explain  observed  “early  turbulence”  in  dilute  polymer 
shear  flows  as  being  polymer-induced  velocity  fluctuations,  with  decreased  frequency  in  regions  of 
increased  shear.  In  this  model,  the  polymer  is  viewed  as  simultaneously  having  a  dual  nature.  When 
considering  deformation^  effects,  the  polymer  is  viewed  as  a  bead-and-spring  dumbbell;  specifically, 
as  two  spheres  of  equal  radius  connected  by  a  nonbendable  spring  that  does  not  interact  with  the 
surrounding  flow.  When  considering  rotational  effects,  the  polymer  is  viewed  as  a  rigid  prolate 
ellipsoid  whose  interior  consists  nf  the  polymer  and  the  accompanying  entrained  fluid  (Figure  1). 
This  dual-natured  model  derives  from  the  following  observations.  Self-avoiding  random  walk  studies 
show  that  the  3-D  conformation  of  a  random  coil  polymer  is  contained  within  an  envelope  with  a 
shape  similar  to  ^hat  of  a  rounded  bar  of  soap.  The  characteristic  lengths  of  this  envelope  occur 
roughly  in  the  ratio  3 : 1.5 : 1  (Rubin  and  Mazur,  1975).  In  a  uniform  shear  flow,  polymers  exhibit  a 
single  response  time.  In  low  strain  rate  flows,  the  polymers  appear  to  undergo  motions  as  though  they 
were  rigid  objects.  Frequency  measurements  indicate  that  this  motion  is  similar  to  the  convection 
and  flipping  of  solid  ellipsoids.  In  higher  strain  rate  flows,  the  flipping  frequency  changes  in  such  a 
way  as  to  indicate  that  the  polymers  have  deformed.  The  simplest  single  time  constant  model  that 
will  capture  this  deformational  behavior  is  that  of  two  massless  dumbbell  ends  connected  by  a  spring. 
Spring  deformation  is  determined  by  the  balamce  between  the  spring  force  and  the  surface  drag  force 
on  the  two  dumbbell  ends.  The  rotational  modeling  remains  that  o'^  a  rigid  ellipsoid.  No  attempt 
is  made  to  capture  the  internal  flow  within  the  polymer  envelope.  Rather,  the  model  is  designed 
to  model  the  motions  of  the  polymer  molecule  both  at  low  strain  rates  and  at  strain  rates  which 
are  high  enough  to  cause  some  deformation.  We  employ  the  Keyes-Abernathy  polymer  model  here, 
generalizing  it  to  3-D  flows  with  streamwise  velocity  gradients.  In  the  following,  all  quantities  have 
been  nondimensionalized  with  respect  to  the  usual  characteristic  values,  unless  stated  otherwise. 

Consider  a  polymer  molecule  embedded  in  a  local  flow  field  u  =  <u,  v,  w>.  We  begin  by  defining 
a  local  Cartesian  coordinate  system  within  the  frsune  of  the  polymer.  Let  the  origin  be  the  instanta¬ 
neous  location  of  the  center  of  the  polymer  at  the  beginning  of  the  computational  time  step.  Choose 
one  axis  in  the  flow  direction,  u.  Choose  another  axis  s  to  be  the  component  of  the  shear  vector 
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perpendicular  to  u:  8  =  V|u|  —  u  •  V|ii|  u.  The  third  axis  is  then  /  =  u  x  s.  Normalize  each  to 
obtain  the  flow  axis  u,  the  shear  axis  i,  and  the  flipping  axis  /.  By  construction,  these  three  vectors 
are  orthogonal.  In  this  coordinate  system,  the  dimensionless  local  shear  k  =  V|u]  ■  i  will  cause 
the  polymer  to  flip  about  the  /  axis.  Local  flow  velocities  work  against  the  stresses  internal  to  the 
polymer  to  translate  the  polymer  ends,  usually  at  different  rates.  Deformations  of  the  polymer  due 
to  shear  stress  and  strain  occur  in  the  direction  of  the  ellipsoid’s  semi-major  axis;  we  neglect  bending 
of  the  polymer  molecule.  We  assume  the  two  effects  of  rotation  and  deformation  to  be  separable;  we 
calculate  them  independently  at  each  step,  and  then  superpose  them  to  And  the  resultant  motion  of 
the  molecule. 

We  first  consider  the  flipping  motion  of  the  polymer.  Toward  this  purpose,  we  envision  the 
polymer  as  a  rigid  prolate  ellipsoid  whose  semi-major  and  semi-minor  axes  have  lengths  A  and  B 
respectively,  where  A  and  B  are  both  measured  in  units  of  H.  Let  Ci  be  the  dimensional  position 
vector  from  the  polymer  center  to  a  point  on  the  semi-major  axis  which  is  a  distance  BH  short 
of  the  i’th  tip  of  the  polymer,  where  i  =  1  or  2  for  the  two  ends  of  the  polymer.  (This  point 
serves  as  the  bead  center  and  spring  end  in  the  bead-and-spring  dumbbell  formulation  below.)  We 
nondimensionalize  the  polymer’s  physical  dimensions  with  respect  to  the  dimensional  length  of  the 
semi- minor  axis,  BH.  The  dimensionless  position  vector  is  •  Let  ^  denote  the  ^ 

with  positive  s  component.  Note  that  ^  is  parallel  to  the  polymer’s  semi-major  axis.  A  convenient 
dimensionless  measure  of  polymer  shape  is  $  =  |^  =  ^  —  1.  Denote  the  projection  of  ^  into  the  u-s 
plane  by  ^p,  with  magnitude  ^p  =  |4p|.  Let  <t>  denote  the  angle  in  the  u-s  plane  from  the  shear  axis 
i  to  4p.  The  local  shear  «  causes  the  polymer  to  flip  about  the  /  axis  in  the  locad  u,s,/  coordinate 
system  at  the  polymer  center.  Jeffery  (1922)  has  calculated  the  angular  velocity  of  this  flipping 
motion  by  setting  the  net  torque  on  the  ellipsoid  equal  to  zero,  thereby  obtaining: 

_  {^p  +  1)*  cos*  (f>  +  sin*  <l>  ^ 


Note  from  this  equation  that  a  sphere  (^p  =  0)  rotates  at  constant  angular  velocity  as  it  is  convected 
along.  As  the  aspect  ratio  increases,  the  tmgul2ir  velocity  is  slowest  when  the  semi-major  axis  is 
aligned  with  the  flow  (^  =  ^)  and  is  fastest  when  the  ellipsoid  is  broeidside  to  the  flow  dixectknt 
{<f>  =  0),  that  is,  when  the  semi-minor  axis  is  aligned  with  the  shear  axi.^  i.  Greater  elongation 
(higher  ^p)  in  a  given  shear  decreases  the  flipping  frequency,  which  is  found  by  integration  of  the 

angular  velocity  equation  1  to  be  •  At  a  given  elongation  ^p ,  higher  shear  also  causes 

'(sp  +2?p+2) 

the  polymer  to  rotate  f:  ster.  To  calculate  the  flipping  motion  of  the  polymer  at  each  time  step,  we 
first  project  the  polymer  into  the  u-s  plane,  calculate  4>  and  ^p  from  the  projection,  smd  then  use 
the  angular  velocity  equation  1  to  calculate  the  rotation  about  the  /  axis. 

We  now  consider  polymer  deformations.  During  e2M:h  computational  time  step,  we  assume  the 
following: 


1 .  The  external  streamwise  strain  rate  is  constant  over  the  length  of  the  polymer  during  the 
step. 

2.  The  external  flow  velocity  is  parallel  to  the  flow  axis  u  over  the  length  of  the  polymer  during 
the  step.  Note  that  this  assumption  is  necessarily  true  at  the  beginning  of  each  step  at  the 
polymer  center  by  construction  of  u,  and  will  remain  approximately  true  for  sufficiently  small 
steps. 

3.  The  polymer  retains  an  equivalent  elliptical  shape,  with  no  bending. 
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Forces  acting  on  the  polymer  are  the  Stokes  drag  acting  over  the  body  of  the  polymer  and  the  strain 
internal  to  the  polymer.  For  simplicity,  we  now  use  a  bead-and-spring  dumbbell  model  of  the  polymer 
to  calculate  these  forces.  We  model  the  polymer  by  two  equivalent  spheres  of  dimension^  radius 
BH  joined  by  a  spring  of  dimensional  equilibrium  length  which  has  no  direct  interaction  with 

the  flow.  We  adopt  a  variant  of  the  Fraenkel- Warner  spring  (Bird,  1987)  with  dimensional  restoring 
force  f  =  where  ^  is  a  dimensional  spring  constant,  and; 


G(0 


1  1 


if  ^ 

if  ^  >  iwir 


1 


(2) 


Keyes  and  Abernathy  (1987)  have  previously  determined  most  of  the  parameter  values  for  the  above 
spring  function  by  fitting  the  model  to  experiments.  The  equilibrium  length  of  the  polymer  is  set 
to  =  2  in  order  to  match  the  observed  frequency  of  polymer-induced  fluctuations.  The  softening 
constant  is  set  to  p  =  4  to  allow  significant  stretching  to  begin  in  the  observed  regime.  The  Warner 
constant  is  set  to  =  100  as  representative  of  the  maximum  observed  (or  inferred)  polymer 
elongation.  We  add  the  last  term  in  the  spring  function  equation  2,  which  activates  at  =  0.9 
of  the  m2iximum  extension  with  hardening  constant  q  =  0.1,  to  model  the  eventual  tightening 
of  the  polymer  near  its  maximum  extension.  We  choose  this  form  so  as  to  eliminate  the  sharp 
jerk  which  Keyes  and  Abernathy’s  (1987)  polymer  underwent  when  the  polymer  reached  meiximum 
extension.  This  function  is  graphed  in  Figure  2.  While  it  possesses  the  qualitative  behavior  we 
seek  in  a  relatively  simple  form,  there  are  many  other  equally  good  alternatives.  To  account  for 
hydrodynamic  shielding  of  the  two  spheres  at  small  separations,  we  multiply  the  Stokes  drag  by  a 
factor  0  which  is  unity  at  infinite  separation  and  approaches  infinity  as  the  spheres  touch.  This 
prevents  interpenetration  of  the  spheres.  Brenner  (1961)  has  calculated  this  /?  for  Stokes  flow, 
yielding: 


.  _  4  .  ^  m(m  -t-  1)  4  cosh^  (m  -f  |)a  -|-  (2m  -b  1)^  sinh^  a 

3*^”  (2m  —  l)(2m -i- 3)  2sinh(2m -t- l)a  —  (2m -1- l)sinh2a 


where  a  =  cosh”^  This  function  is  plotted  in  Figure  3.  Setting  the  sum  of  the  Stokes  and  spring 
forces  to  zero  yields  a  differential  equation  for  the  position  of  each  of  the  spheres  £,■  in  terms  of 
dimensional  variables; 

dCi  _  AUi  gcio  ,  . 

dJ  67rp/?(0^’  ^  ^ 

where  Ai/j  is  the  dimensional  velocity  difference  between  the  ambient  fluid  at  the  I’th  sphere  and 
at  the  polymer  center.  Since  we  are  dealing  with  a  flow  that  has  nonzero  strain  rate  in  essentially 
only  the  streamwise  direction,  the  dimensionless  velocity  difference  is 

=  -^[“{0}  +  i^i  -  *o)«r]i  +  [Vfi}  -  +  [W{i]  -  W'{0}]z,  (5) 

where  the  subscripts  i  denote  ambient  quantities  at  the  i**  sphere,  and  where  the  subscripts  zero 
denote  quantities  at  the  polymer  center.  When  nondimensionalizing  the  position  vector  equation  4 
with  respect  to  the  usual  characteristic  variables,  we  define  C  =  ^  ^  dimensionless  measure 

of  the  spring  force  versus  the  average  viscous  drag  force  on  the  spheres.  This  dimensionless  spring 
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constant  C  is  related  to  the  an2dogou8  constant  used  in  Keyes  and  Abernathy  (1987)  by  C  =  CKtyt$  k, 
where  k  is  the  dimensionless  local  shear. 

Another  issue  to  be  addressed  is  whether  the  polymer  should  maintam  constant  volume  during 
deformations,  or  whether  it  should  maintain  constant  semi-minor  axis,  or  whether  it  should  possibly 
exhibit  some  intermediate  behavior.  This  will  be  am  important  issue  later  when  investigating  how 
much  deformation  polymers  undergo  when  embedded  in  a  vortical  flow.  In  the  next  section  we 
will  argue  that  the  actual  polymer  behavior  is  probably  approximately  constant  semi-minor  axis 
at  small  deformations  when  polymer  coils  are  still  relatively  tightly  wound,  and  is  approximately 
constant  volume  at  large  deformations  when  all  coils  have  been  effectively  opened  and  the  polymer 
has  saturated  with  entrained  fluid.  For  now,  we  will  simply  outline  the  calculation  of  the  two 
extreme  cases.  In  the  constant  semi-minor  axis  case,  we  just  take  the  difference  between  the  current 
positions  of  the  two  sphere  centers  to  find  the  separation  L,  then  calculate  the  current  aspect 
ratio  ^  In  the  constant  volume  case,  the  semi-minor  axis  length  B  is  variable.  We  then 

combine  the  equation  for  the  polymer  volume  V  =  -I- 1)5®  with  the  dimensionless  deformation 

^  ^  eliminate  B.  This  yields  a  cubic  equation  for  the  current  deformation 

^ .  Therefore  at  the  end  of  eaw:h  computational  step  we  use  the  known  constsmt  volume  V  amd  the 
new  elongation  L  to  solve  for  and  then  we  solve  for  the  new  semi-minor  axis  B  =  i.  Since  B  is 
variable,  the  Stokes  drag  over  the  sphere  decreases  as  its  radius  B  decreases  -  followed  to  its  logical 
conclusion,  we  would  therefore  decrease  the  dimensionless  drag  constant  '  as  polymer  extension  { 
increases.  Yet  we  expect  the  Stokes  drag  over  the  entire  surface  of  the  pc: ymer  to  remain  constant  or 
perhaps  even  increase  at  larger  polymer  extensions.  We  model  this  c,,  "“cted  behavior  by  retaining 
a  constant  dimensionless  drag  C  at  all  polymer  extensions.  This  drag  ccustant  C  is  taken  to  be  that 

of  the  polymer  at  equilibrium  =  ^e,).  Therefore,  C  =  where  V  is  the  given 

constant  volume. 

In  summary,  our  computation  for  the  motion  of  a  polymer  embedded  in  an  external  flow  is: 

•  Construct  local  coordinate  system: 

«  _  y  - 

.  _  V|u|-iiV|g|ti 

|V|ul-fkV|uItir 

/  =  fixs.  (8) 

•  Project  polymer  into  u-s  plane.  Determine  4>  ^d  from  the  projection. 

•  Calculate  rotation  about  /  axis: 

^  _  (Cp-H)^cos®^-hsin^<i 

dt  ($p  + 1)2  +  1 

•  Calculate  Li  =<  Xi,yi,  Zi  >  for  both  polymer  ends  (i  =  1, 2)  due  to  deformation: 

^  u{0}  +  (x,-  -  xo)u^  R  ^  G{OB  R 

dt  m  Ru  ^  m 

dt  m  ^ /?(o 

fifi  _  ^{i)  -  “'{0}  ^  GjQB  R 

dt  m  ^  m  ^ 
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•  Superpose  rotation  and  deformation  to  calculate  new  polymer  position. 

•  If  constant  volume:  calculate  and  then  B,  from: 


6V'  ,3 


jrZ.3 


= 


0, 

I 


(11) 

(12) 


•  If  constant  semi-minor  axis:  calculate  ^  ^ . 

We  embed  the  above  polymer  in  the  external  flow  of  a  streamwise  vortex  in  a  shear  flow  by  specifying 
initial  values  for  the  position  of  the  tw-o  spherical  ends  and  for  (  (which  in  turn  determines  the  initial 
semi-minor  axis),  and  then  running  the  flow  model.  After  each  computational  flow  step,  we  use  the 
local  flow  velocities  to  calculate  an  updated  shape,  position,  and  orientation  for  the  polymer. 


Figure  1 :  The  hybrid  polymer  model,  regarded  as  an  ellipsoid  for  calculating  rotation,  and  as  a  bead-and- 
spring  dumbbell  for  calculating  deformation.  The  first  view  shows  dimensional  lengths,  the  second  view 
shows  dimensionless  lengths,  and  the  third  view  shows  dimensionless  lengths  projected  into  the  ti-s  plane. 
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SHIELDING  FACTOR  f 


Figure  2;  The  dimensionless  spring  force  G  as  a  function  of  polymer  extension  as  given  by  Equation  2. 
Note  the  repulsive  force  when  compressed  from  equilibrium,  and  the  smooth  tightening  as  the  spring  ap¬ 
proaches  its  maximum  extension. 


f 


Figure  3:  The  Brenner  correction  factor  as  a  function  of  polymer  extension  as  given  by  Equation  3. 
Multiplying  the  Stokes  drag  by  this  factor  represents  shielding  of  the  radial  fluid  force  on  one  sphere  due  to 
the  presence  of  the  other,  and  prevents  interpenetration  of  the  two  spheres  at  small  separations. 


3  Benchmark:  Comparison  to  Previous  Results 

In  this  section,  we  validate  the  above  methodology  for  polymer  dynamics  by  comparison  with 
previous  results.  Since  our  model  is  a  generalization  of  the  model  of  Abernathy  et  al  (1980)  and 
Keyes  and  Abernathy  (1987),  we  first  replicate  their  results  with  our  model,  then  perturb  the  flow 
slightly  so  as  to  still  obtain  essentially  the  same  results,  but  by  using  some  of  the  other  logical 
branches  in  our  numerical  implementation. 

VVe  first  consider  a  polymer  molecule  with  the  original  spring  constant^  from  Abernathy  et  al 
(1980)  and  Keyes  and  Abernathy  (1987): 


G(0  = 


1  -(7^)^ 


1 

(l+^-Ce,)-'' 


if€  >6,  J  • 


(13) 


Fixed  parameters  of  the  polymer  are  a  maximum  extension  of  =  10,  softening  constant  p  =  0, 
and  shielding  function  /?($)  =  i.  We  model  the  polymer  as  having  constant  semi-minor  axis  B; 
in  other  words,  as  having  variable  volume.  We  embed  this  polymer  in  a  uniform  shear  flow  which 
can  be  described  dimensionally  as  U  =<lCy,  0, 0>.  This  is  the  situation  investigated  by  Abernathy 
et  al  (1980).  For  nondeformable  (C  — ►  00)  polymers  whose  equilibrium  shapes  are  ellipsoids  of 
various  extensions  ^4,,  we  generate  rotation  angles  <j»  versus  time  as  shown  in  Figure  4.  These  curves 
correspond  to  Figure  6  of  Abernathy  et  al  (1980).  Note  that  the  flipping  frequency  decreases  with 
increasing  equilibrium  extension  •  For  polymers  of  various  equilibrium  shapes  f*?  and  of  v2urious 
stiffnesses  C,  we  generate  the  flipping  periods  shown  in  Table  1.  These  correspond  to  selected  points 
from  Figure  8  of  Abernathy  et  al  (1980).  In  additional  trials,  we  note  the  relative  insensitivity 
of  polymer  extension  to  the  maximum  allowable  extension  ,  as  long  as  the  adlowable  maximum 
remains  large  compared  to  the  actual  maximum  deformation  which  the  polymer  undergoes.  For 
polymers  of  equilibrium  shape  =  2,  maximum  extension  fiv  =  10,  and  of  various  stiffnesses 
C,  we  generate  deformations  4  versus  rotation  angle  <f>  as  shown  in  Figure  5.  These  correspond  to 
Figure  9  of  Abernathy  et  al  (1980).  In  all  cases,  agreement  is  excellent. 

Next,  we  again  consider  a  polymer  molecule  with  constant  semi-minor  axis  and  with  spring 
constant  given  by  Equation  13.  Now  we  fix  the  polymer’s  parameters  at  an  equilibrium  extension 
of  (gj  =  2,  maximum  extension  =  100,  softening  constant  p  =  4,  and  shielding  function  as 
given  by  Equation  3.  For  polymers  of  various  stiffnesses  C,  we  find  the  limit  cycles  of  deformation 
versus  rotation  angle  as  shown  in  Figure  6.  This  figure  corresponds  to  Figure  8  of  Keyes  and 
Abernathy  (1987).  Again,  agreement  is  good.  Early  discrepancies  in  the  analysis  of  this  case 
revealed  a  typographical  error  in  Equation  10b  of  Keyes  and  Abernathy  (1987).^ 

In  order  to  test  other  branches  of  our  logic,  we  again  consider  a  polymer  with  the  same  fixed 
parameters  as  above,  and  with  stiffness  C  =  5,  in  flows  slightly  perturbed  from  that  used  above. 
The  flows  we  test  are  listed  in  Table  2.  In  all  cases,  the  results  are  practically  the  same  as  obtained 
above.  This  provides  some  degree  of  assurance  that  our  various  logical  branches,  used  for  dealing 
with  different  flow  situations,  are  working  as  intended. 

Finally,  we  test  our  logic  for  dealing  with  the  stream  wise  strain  rate  component  of  a  flow  by 
embedding  a  nonresisting  (C  =  0)  polymer  in  a  pure  streamwise  rate  of  strain  flow  u  =<0.01(a:  -F 
0.005),  0,0>.  Although  compressible,  this  flow  is  a  simple  test  of  our  handling  of  the  streamwise 

*  This  original  spring  constant  imparts  a  very  sharp  change  in  tension  to  the  polymer  as  it  nears  its  maximum 
extension  (  =  (nr,  much  like  a  rope  that  has  been  snapped  to  its  greatest  extension.  This  was  understandably  of 
little  concern  to  Keyes  and  Abernathy,  given  their  modest  polymer  extensions  in  pure  shear  flow.  Since  we  shall  soon 
see  that  our  flow  will  deform  polymers  to  a  much  greater  degree,  we  have  replaced  this  original  form  with  the  more 
smoothly  tightening  form  of  Eiquation  2. 

*The  corrected  equation  reads:  i  =  (  sin0co8  0//3(4)  —  Cg(()/0((). 
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Spring  Stiffnenn 

(O 

Equilibrium  Extension 

Maximum  Extension 

((mss) 

Period 
(to  flip  180*) 

0.01 

0 

0.00001 

6.2819 

0.01 

1 

1.87 

9.7366 

0.01 

2 

4.07 

14.9224 

0.1 

2 

3.97 

14.7952 

1 

2 

2.89 

11.1434 

10 

2 

2.10 

10.4803 

100 

0 

0.000005 

6.2819 

100 

1 

1.005 

7.8541 

100 

2 

2.01 

10.4721 

Table  1:  Maudmom  extennons  and  flipping  periods  for  polymers  of  various  equilibrium  shapes  and  stilfnesses 
in  uniform  shear  flow.  Other  parameters  are  the  same  as  for  the  pdymers  of  Figure  4.  Replication  of  selected 
points  from  Figure  8,  Abernathy  et  al  (1980). 


Flow  Type 

Vortex  Strength 

Ry 

Local  Velocity 
u 

Streamwise  Strain  Rate 

X  independent 

0 

0.5 

0 

X  independent 

0.1 

0.5 

0 

t  independent 

0.1 

0.5 

0  (held  constant) 

t  independent 

0.1 

0.5 

10”^  (held  constant) 

i  independent 

0 

0.75 

0 

Table  2:  Slightly  perturbed  flow  conditions  for  the  polymer  (C  =  5)  of  Figure  6.  We  duplicate  that  figure 
in  each  of  these  flows,  testing  other  branches  of  our  logic  for  different  flow  situations.  Local  shear  is  ^  =  1 
in  ail  cases. 
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oraiii  rate  component  of  a  3-D  flow.  Results  are  in  exact  agreement  with  the  analytic  solution  -_the 
f^s^h  oHhe  Xter  grows  exponentially  in  time,  and  linearly  with  the  streamwise  position  of  the 

XhhoS  Keves  experimented  with  constant  volume  polymers,  he  does  not  report  on  change 
in  limit  cycle  obtained  when  changing  from  constant  semi-minor  axis  to  constant  %  olume.  For  spri  g 
constant  C  =  5  in  uniform  shear  flow,  we  observe  the  maximum  polymer 

from  £  =  4  2  to  Uar  =  181-  This  seems  an  unreasonably  large  extension  in  pure  shear  flovv^ 

We  sus7ect  that  the  actual  polymer  behavior  is  approximately  constant  semi-minor 
deformations  when  polymer  coils  are  still  relatively  tightly  wound,  and 

volume  at  large  deformations  when  all  coils  have  been  effectively  opened  and  the  polymer  h^ 
saturated  with  entrained  fluid.  It  therefore  seems  reasonable  to  treat  polymers  which  undergo  on 
modest  deformations  (Uar  <  10)  under  the  constant  semi-minor  axis  assumption  as  bemg  well- 
described  by  that  assumption,  and  to  treat  polymers  which  would  undergo  sipificant  stretching 
(&.ar  >  10)  under  the  same  assumption  as  actually  being  better  modeled  by  the  constant  volume 
assumption.  We  implement  this  idea  in  our  polymer  simulations  of  the  following  section. 

Now  that  we  have  tested  and  validated  the  polymer  model,  we  are  ready  to  check  its  predictions 

in  a  more  interesting  flow. 
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Figure  4:  Rotation  angle  c)  versus  time  for  nondeformable  (C  — -  oo)  polymers  of  equilibrium  extension 
=  1,  2,  4,  8  in  uniform  shear  flow.  Equation  13  describes  the  spring  force,  shielding  function  is  |9(£)  =  1, 
maximum  extension  is  £;v  =  10,  and  softening  constant  is  p  =  0.  Replication  of  Figure  6,  Abernathy  et  al 
(1980). 
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Figure  5:  Polj’mer  deformation  {  versus  rotation  angle  p  for  polymers  of  stiffness  C  =  0.01,  0.1,  1,  10,  100 
in  uniform  shear  flow.  Equilibrium  shape  is  Cm  =  2.  and  other  parameters  are  the  same  as  for  the  polymers 
of  Figure  4.  Replication  of  Figure  9,  Abernathy  et  al  (1980). 
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Figure  6:  Limit  c.vcie  of  polymer  deformation  (  versus  rotation  angle  ^  for  polymers  of  stiffness  C  =  0.5, 
3.  30.  Equation  13  describes  the  spring  force,  shielding  function  /9(()  is  given  by  Equation  3,  equilibrium 
shape  is  =  2,  maximum  extension  is  (iv  =  100,  and  softening  constant  is  p  =  4.  Replication  of  Figure  8, 
Keyes  and  Abernathy  (1987). 
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4  Dynamics  of  the  Embedded  Polymer 

We  now  investigate  the  dynamics  of  the  polymer  of  Section  2  embedded  in  a  streamwise  indepen¬ 
dent  vortical  flow  of  a  transient,  streamwise  independent  flow  and  in  a  streamwise  dependent  flow, 
both  of  which  are  comprised  of  a  diffusing  streamwise  vortex  aligned  in  the  flow  direction  of  a  shear 
flow.  Our  aim  here  is  to  determine  whether  such  flow  fields  are  capable  of  deforming  embedded 
polymers  significantly  more  than  uniform  shear  flows. 

We  first  show  that  significant  polymer  deformations  do  not  occur  in  vortical  flows  which  lack 
streamwise  velocity  gradients.  We  consider  both  streamwise  independent  vortical  flows  and  stream- 
wise  dependent  flows  in  which,  within  the  numerical  code,  the  local  streamwise  strain  rate  in  the 
vicinity  of  the  polymer  has  been  artificially  set  to  zero.  In  both  of  these  flows,  a  polymer  of  variable 
volume  (constant  semi-minor  «ixis)  flips  and  deforms  in  the  same  manner  found  by  Keyes  (and  as  ver¬ 
ified  in  our  benchmarks.  Section  3)  in  uniform  shear,  as  determined  by  the  local  shear  m  the  vicinity 
of  the  polymer.  This  occurs  regardless  of  initial  location  and  initial  orientation  of  the  polymer,  and 
regardless  of  vortex  strength.  This  is  due  to  the  large  difference  between  the  characteristic  length 
scales  of  the  polymer  and  the  vortex;  on  the  length  scale  of  the  polymer,  the  local  flow  field  due  to 
the  vortex  is  indistinguishable  from  a  uniform  shear  flow.  The  only  effect  of  the  vortex  is  to  rotate 
the  local  coordinate  system  within  which  the  polymer  executes  its  flipping  motion.  Peterlin  (1970) 
anticipated  this  failure  of  the  vortex  crossflow  to  act  as  anything  other  than  a  new  shear  on  the 
polymer  s  length  scale,  and  tried  to  resolve  the  problem  by  proposing  the  polymers  to  be  activated 
by  the  crossflow  shear  of  “microvortices’' ,  whose  crossflow  length  scale  is  comparable  to  that  of  the 
polymer.  These  microvortices  would  have  to  be  nearly  potential,  and  the  polymer  would  have  to  be 
very  close  to  the  vortex  center,  for  significant  polymer  deformation  to  occur.  However,  the  number 
density  of  both  polymers  and  of  microvortices  of  the  requisite  size,  strength,  and  concentration  are 
low  enough  to  make  the  required  vortex-polymer  pairing  too  rare  an  event  to  enable  this  to  be  a 
viable  mechanism.  We  therefore  conclude  that  significant  deformations  of  a  constant  semi-minor 
axis  polymer  do  not  occur  in  vortical  flows  which  lack  streamwise  velocity  gradients,  regardless  of 
initial  polymer  location  and  orientation  and  regardless  of  the  vortex  strength. 

We  now  consider  a  streamwise  dependent  vortical  flow  with  streamwise  velocity  gradients.  Since 
we  find  that  a  polymer  embedded  ’n  such  a  flow  experiences  large  deformations  under  the  constant 
semi-minor  axis  assumption,  we  conclude  that  the  polymer  is  probably  better  described  as  being  of 
constant  \olume,  as  discussed  at  the  end  of  the  previous  section.  Figure  7  shows  the  deformation  of 
a  constant  volume  polymer  near  an  initially-potential  vortex  of  strength  =  6.5  as  the  polymer 
IS  convected  in  the  streamwise  direction.  Note  that  during  the  few  flips  before  the  streamwise 
gradient  induces  supercritical  stretching  (at  streamwise  positions  x  <  12),  the  maximum  polymer 
deformation  is  about  a  factor  of  fc  ..r  smaller  than  that  expected  in  the  same  flow  without  a  vortex. 
This  is  because  the  edge  of  the  vortex  core  quickly  diffuses  past  the  polymer,  thereby  flattening 
the  local  shear  near  the  polymer.  At  streamwise  position  x  ss  12,  we  see  that  the  streamwise 
velocity  gradient  has  grabbed  the  polymer  and  induced  supercritical  stretching.  Figure  8  shows  the 
deformation  of  a  constant  volume  polymer  near  the  core  edge  of  a  fully-diffused  vortex  (meaning  near 
the  flow  boundary)  of  strength  ~  6.5.  Again,  we  see  that  the  streamwise  velocity  gradient  grabs 
the  polymer  during  each  flip  cycle  and  induces  significant  stretching,  with  maximum  deformation 
similar  to  that  found  near  an  initially-potential  vortex.  Experimentation  shows  that  the  threshold 
vortex  strength  required  to  induce  these  large  extensions  is  ta  3  near  a  potential  vortex,  ot  Rj,  sv  5 
near  the  edge  of  a  diffuse  vortex.  The  streamwise  velocity  gradient  grabs  the  polymer  when  it  is  at 
favorable  orientations;  the  onset  of  large  deformations  can  be  relatively  sensitive  to  this  initial 
orientation,  but  polymers  whose  semi-major  axis  i  is  initially  within  about  25*  of  the  ti  -  i  plane 
achieve  the  required  orientation  within  about  one  to  three  flips  (Figure  9).  Polymers  that  begin  at 
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orientations  outside  of  this  range  perform  “log  rolls”  about  the  flipping  axis  /,  and  do  not  experience 
significant  stretching  (Figure  10).  So  for  a  dilute  concentration  of  polymers  with  initially  rwdom 
orientation  located  near  such  vortices,  we  can  expect  about  28%  to  be  activated  by  the  vortex.  This 
yields  an  £7(1)  correction  to  the  effective  polymer  concentration. 

A  positive  stresunwise  velocity  gradient  is  necessary  to  overcome  the  initial  spring  resistance. 
Theresifter,  the  streamwise  gradient  is  relatively  unimportant  to  further  stretching,  and  a  positive 
streamwise  velocity  difference  between  the  two  polymer  ends  is  sufficient  for  continued  stretching. 
This  is  demonstrated  by  the  fact  that  once  significant  stretching  has  begun,  a  positive  streamwise 
velocity  difference  can  continue  the  stretching,  even  if  the  polymer  passes  through  a  region  of  negative 
streamwise  velocity  gradient.  Polymers  usually  pass  several  times  from  regions  with  one  sign  of 
streamwise  gradient  to  regions  with  the  other  sign  of  streamwise  gradient.  This  is  because  there 
are  either  two  or  four  such  regions  of  alternating  sign  in  the  neighborhood  of  the  vortex,  and  since 
polymers  remain  on  roughly  a  cylindrical  surface  as  they  are  convected  around  by  the  crossflow  and 
downstream  by  the  mean  flow,  therefore  the  crossflow  naturally  converts  polymers  around  between 
the  regions  of  streamwise  velocity  gradient  with  alternating  sign. 

Since  the  rate  of  spring  softening  at  larger  extensions  (reflected  in  the  softening  constant  p  of 
the  polymer  model)  was  not  as  sharply  determined  as  the  other  model  constants  in  the  original 
formulation  by  Keyes  and  Abernathy,  it  is  appropriate  to  determine  the  sensitivity  of  our  observed 
supercritical  polymer  stretching  behavior  to  changes  in  spring  softening.  We  find  that  qualitatively 
similar  stretching  occurs  for  springs  with  softening  constant  p  >  2.7  (Figure  11).  This  threshold 
corresponds  to  a  maximum  spring  cons*  in'  of  Gmas  =  0.17,  occurring  at  extension  ^  =  2.6.  Larger 
softening  constants  (such  as  Keyes  and  Abernathy’s  best  fit  of  p  =:  4,  which  yields  a  maximum 
spring  constant  of  Gmax  =  0.10  at  ^  -  2.3)  allow  the  positive  streamwise  velocity  gradient  to  stretch 
the  polymer  more  quickly,  but  to  approximately  the  same  extensions.  Smaller  softening  constants 
p  <  2.7  correspond  to  greater  maximum  spring  constants  Gmax  >  0.17,  which  are  strong  enough  to 
inhibit  large  stretching  of  the  polymer  at  the  streamwise  strain  rates  typically  found  in  our  vortical 
flows. 

Figures  7  and  8  illustrate  how  large  deformations  (on  the  order  of  ^  50  to  60)  can  occur  for 

vortices  with  different  initial  distributions  of  vorticity.  This  is  one  order  of  magnitude  larger  than 
the  deformation  obtained  in  laminar  shear  flow.  Since  the  streamwise  strain  rate  is  the  physical 
mechanism  responsible  for  initiating  supercritical  stretching  of  the  polymers,  we  infer  that  large 
polymer  deformations  begin  most  often  at  the  edge  of  the  vortex  core,  where  the  streamwise  strain 
rate  is  a  maximum.  This  statement  holds  on  average,  but  not  ^dway£  in  individual  cases,  since 
the  onset  of  large  polymer  deformations  is  dependent  to  some  degree  on  the  initial  orientation  of 
the  polymer. 

Figure  12  shows  how  polymers  at  several  different  initial  locations  in  the  flow  can  all  be  subject 
to  a  large  amount  of  deformation.  Even  a  polymer  that  begins  on  the  side  of  the  vortex  with  negative 
streamwise  strain  rate  is  greatly  elongated  as  it  is  convected  around  the  vortex  and  into  a  region 
with  positive  strain  rate. 

Figure  13  shows  how  the  maximum  deformation  of  a  polymer  varies  as  a  function  of  the  vortex 
Reynolds  number  Rv  ■  At  each  Rv ,  the  maximum  deformation  shown  is  taken  from  over  a  variety  of 
initial  locations  and  orientations  of  the  polymer.  It  appears  that  stronger  vortices  generally  deform 
the  polymers  to  a  greater  degree.  Note  that  the  polymer  is  inextensible  past  the  Warner  limit 

=  100. 

Constants  in  our  adopted  polymer  model  were  fit  by  Keyes  and  Abernathy  in  accordance  with 
data  that  for  the  most  part  involved  polymers  at  small  deformations.  A  degree  of  uncertainty  in  these 
parameters  enters  when  we  begin  to  deal  with  highly-deformed  polymers.  In  fact,  small  changes  in 
the  polymer  model  at  large  deformations  can  significeuitly  change  the  maximum  polymer  extension. 
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For  example,  Figure  14  shows  the  deformation  of  the  polymers  of  Figures  7  and  8  under  the  change 
that  the  flipping  rate  ^  is  reduced  by  half  after  a  polymer  extension  of  (  >  10.  In  both  cases, 
the  polymer  deforms  to  its  maximum  extensional  limit.  Thus,  at  these  large  polymer  deformations, 
there  is  a  degree  of  model  uncertainty.  Indeed,  as  large  as  our  current  polymer  deformations  are, 
they  may  be  even  larger  with,  for  example,  decreased  flipping  rate,  increased  spring  softening,  or 
increased  Stokes  drag  with  polymer  extension. 

The  large  polymer  extensions  demonstrated  above  raise  the  issue  of  polymer  entanglement.  Our 
non-bending  polymer  assumption  seems  quite  reasonable  for  polymers  undergoing  stretching,  but 
seems  improbable  during  contraction,  especially  for  highly  elongated  polymers.  Memory  and  bend¬ 
ing  of  highly  elongated  polymers  during  contraction  make  it  seem  likely  that  polymers  take  longer 
to  collapse  than  to  stretch,  and  that  the  polymer  would  undergo  significant  bending  2uid  buckling 
during  this  time.  The  flipping  of  these  bent  polymers  suggests  the  possibility  of  polymer  entangle¬ 
ments  above  some  threshold  concentration.  In  order  to  make  a  crude  estimate  as  to  this  required 


concentration,  assume  a  cubic  lattice  of  polymers  aligned  in  the  flow  direction  with  uniform  sepa¬ 
ration  S.  The  distances  between  polymer  centers  in  the  two  crossflow  directions  are  both  2B  -1-  5, 
and  the  distance  between  polymer  centers  in  the  flow  direction  is  2(^e,  -b  1)B  -I-  5.  Therefore  the 
number  density  of  polymers  is  -njB-t-s]  volume  concentration  of  polymers  is 


equilibrium  extension  of  =  2  and  assuming  that  the  polymer  bends 


approximately  in  half  as  it  flips  (S  ~  ia^£.  gg  30B),  we  anticipate  significant  polymer  entangle¬ 
ments  at  volume  concentrations  greater  than  ~  8  x  10“®.  This  agrees  well  with  observed  volume 
concentrations  of  polymer  additives  required  for  the  onset  of  turbulence  suppression  and  turbulent 
drag  reduction.  This  equispaced  cubic  arrangement  is  a  rather  crude  assumption;  a  more  closely 
packed  arrangement  (such  as  hexagonal  or  face-centered  cubic)  seems  more  likely.  Bertschy  (1979) 
discusses  how  these  other  arrangements  would  change  the  above  threshold  volume  concentration  by 
a  factor  of  at  most  an  increase  of  only  13%.  It  should  be  noted  that  with  the  onset  of  polymer 
entanglements  at  large  extensions,  the  validity  of  our  polymer  model  in  this  regime  is  again  suspect. 

In  this  section,  we  have  presented  evidence  of  a  heretofore  unnoticed  mechanism  for  the  activation 
of  polymers  in  a  turbulent  flow.  This  mechanism  is  the  generation  in  vortical  flows  of  streamwise 
strain  rates  of  sufficient  strength  to  stretch  nearby  polymers  to  large  elongations.  With  the  resump¬ 
tion  of  flipping  by  these  highly  deformed  polymers,  we  have  also  demonstrated  a  viable  mechanism 
for  intermolecular  entanglements  in  dilute  polymer  solutions.  In  the  next  section,  we  infer  how  these 
elongated  polymers  affect  the  evolving  flow  field. 
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Figure  7:  Extension  of  a  polymer  located  near  an  initially-potential  streamwise  vortex  with  streamwise 
velocity  gradients.  The  streamwise  gradient  soon  overcomes  the  maximum  spring  force  and  begins  super¬ 
critical  stretching  of  the  polvmer.  Relaxation  of  the  polymer  is  eventually  caused  by  rotation  of  the  polymer 
axis  (  past  the  flow  axis  u.  C  =  5,  a  =  2.  R.,  =  6.5,  R  =  3000,  polymer  is  initially  located  at  r  _  -0.1, 
y  =  0.5,  1  =  0  and  oriented  with  its  semi-major  axis  parallel  to  the  i  axis. 
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Figure  8:  Extension  of  a  polymer  located  near  the  core  edge  (flow  boundary)  of  a  fully  diffused  stream- 
wise  vortex  with  streamwise  gradients.  The  streamwise  velocity  gradient  again  induces  significant  polymer 
stretching,  with  ma.xiniuin  deformations  similar  to  those  fouud  near  an  initially-potential  vortex.  C  =  5, 
a  =  2,  R^  =  6.5.  R  =  3000,  polymer  is  initially  located  at  j  =  -0.9,  y  =  0.5,  r  =  0,  and  is  initially  oriented 
with  its  semi-major  axis  parallel  to  the  y  axis. 
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Figure  9:  Maximum  deformation  of  the  polymer  of  Figure  7  when  started  with  semi-major  axis  in  the  i  —  i 
plane  and  at  various  angles  from  the  i  axis.  Large  deformations  occur  when  the  polymer  is  within  about 
25®  of  the  i  axis. 
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Figure  10;  Extension  of  a  polymer  loc*  -*<i  near  an  initiaUy-potential  streamwise  vortex  with  streamwise 
velocity  gradients.  The  polymer  is  iniually  located  in  the  f  —  i  plane  with  a  35*  angle  between  its  semi-major 
axis  and  i.  The  polymer's  semi-major  axis  becomes  generally  aligned  with  the  flipping  axis  /  and  performs 
“log  rolls".  C  =  5,  o  =  2,  vRv  =  6.5,  R  =  3000,  polymer  is  initially  located  at  r  =  —0.1.  y  =  0.5,  z  =  0. 
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Figure  11:  Maximum  deformation  of  the  polymer  of  Figure  7  for  various  values  of  the  spring  softening 
constant  p.  Large  deformations  occur  for  values  greater  than  about  2.7. 
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Figure  12:  Maxitnum  deformation  of  the  polymer  of  Figure  7  for  various  initial  spanwise  locations.  Large 
deformations  can  occur  for  a  wide  range  of  initial  spanwise  locations.  Even  polymers  that  begin  on  the 
side  of  the  vortex  with  negative  streamwise  strain  rate  are  expanded  as  they  are  convected  into  the  positive 
strain  rate  region. 


Figure  13;  Maximum  deformation  of  a  polymer  near  vortices  of  various  vortex  Reynolds  numbers.  Stronger 
vortices  seem  to  induce  larger  maximum  deformations,  until  the  Warner  limit  {^w  =  100)  is  reached.  C  =  5, 
o  —  2.  R  =  3000.  The  maximum  deformation  is  taken  over  all  polymers  whose  initial  orientations  are  as  in 
Figure  9  and  whose  initial  locations  are  as  in  Figure  12. 
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Figure  14:  Deformation  of  the  polymers  of  Figures  7  and  8  under  the  change  that  the  flipping  rate  is  halved 
after  the  polymer  is  significantly  stretched.  Deformation  ceases  after  the  polymer  is  almost  fuUy  extended. 
This  demonstrates  how  small  changes  in  the  polymer  model  can  cause  significant  changes  in  the  dynamics 
of  highly  deformed  polymers. 
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5  Implications  for  Flow  Stability 

The  development  of  highly  elongated  polymers  in  a  streamwise  dependent  vortical  flow  implies 
an  increase  in  the  local  viscosity  due  to  increased  energy  dissipation  in  the  form  of  a  secondary  flow 
around  the  polymers.  We  now  address  several  related  questions  in  a  relatively  nonrigorous  way. 

Where  does  an  increase  in  the  flow  viscosity  occur?  The  development  of  elongated  polymers 
begins  in  regions  with  positive  streamwise  strain  rate,  but  then  stretching  can  continue  into  other 
flow  regions  with  positive  streamwise  velocity  difference  between  the  polymer  ends,  even  if  the 
streamwise  strain  rate  is  negative.^  Therefore  polymer  extension  is  very  memory-dependent,  and 
cannot  be  readily  predicted  from  local  flow  quantities.  We  do  not  do  so  here,  but  since  polymers 
maintain  a  relatively  constant  distance  from  the  vortex  center  and  since  stretched  polymers  persist 
over  some  time,  it  may  be  reasonable  to  predict  a  mriximum  polymer  extension  at  all  points  in  the 
flow  from  some  local  flow  quantity,  then  take  the  maximum  extension  over  all  angles  at  a  given 
radius  from  the  vortex  center  and  attribute  that  extension  to  all  points  at  the  given  radius.  Polymer 
stretching  is  more  easily  initiated  when  the  streamwise  strain  rate  is  high,  so  it  appears  that 
supercritical  stretching  of  the  polymers  begins  most  readily  at  the  edge  of  the  vortex  core,  where 
the  streamwise  strain  rate  is  highest.  Therefore  it  seems  likely  that  the  local  increase  in  viscosity 
occurs  mainly  at  the  edge  of  and  within  the  vortex  core,  as  the  moving  core  edge  will  have  swept 
over  and  activated  polymers  in  that  region  as  it  diffuses  outward. 

What  is  the  local  increase  in  flow  viscosity  due  to  the  extended  polymer?  Rheological  investiga¬ 
tions  on  this  general  topic  seem  to  fall  into  two  classes:  those  in  pure  shear  flow,  and  those  in  pure 
elongational  flow.  In  pure  shear  flow,  Einstein  (1906)  and  Jeffery  (1922)  investigated  the  secondary 
flow  around  a  dilute  solution  of  rigid  spheres  and  rigid  ellipsoids,  respectively,  to  determine  the  local 
increase  in  flow  viscosity.  Eirich  (1956)  discusses  the  extension  of  these  results  to  a  concentrated 
solution  of  rigid  spheres  as  a  function  of  their  volume  fraction.  Keyes  (1987)  approximates  the 
viscosity  increase  due  to  a  dilute  suspension  of  deforming  ellipsoids  as  the  sum  of  rotational  and  de- 
formational  energy  dissipation  components.  Graessley  (1965)  investigates  a  deforming  entanglement 
network  and  demonstrates  shear-thinning,  but  makes  no  connection  between  solvent  viscosity  and 
the  zero-shear  viscosity  of  his  entangled  network  solution.  In  pure  elongational  flow,  Peterlin  (1966) 
has  studied  dilute  dumbbell  and  necklace  models  with  their  more  numerous  degrees  of  freedom, 
and  King  and  James  (1983)  have  investigated  similar  models  which  are  “frozen”  after  partial  elon¬ 
gation  due  to  intramolecular  entanglements.  Takserman-Krozer  and  Ziabicki  (1963)  have  studied 
the  energy  dissipation  of  dilute  rigid  ellipsoids.  Leal  and  Hinch  (1973)  have  been  concerned  with 
dilute  particles  under  the  influence  of  Brownian  couples  in  both  shearing  and  strain  rate  flows.  In 
a  much-quoted  work,  Batchelor  (1971)  investigated  the  additional  stress  generated  by  both  a  dilute 
solution  and  by  a  hydrodynamically  concentrated  solution^  of  rigid  rods.  Ryskin  (1987)  incorporated 
Batchelor’s  results  into  his  view  of  an  uncoiling  (“yo-yo”)  polymer  which  always  has  a  taut  central 
section  of  some  effective  length.  This  notion  appears  useful  in  that  it  explains  the  experimental 
observation  that  stressed  polymers  nearly  always  break  in  the  middle.  Also,  Batchelor’s  treatment 
of  a  hydrodynamically  concentrated  rigid  rod  solution  seems  in  rough  agreement  with  the  more 
recent  tube  model  of  De  Gennes  (1971)  and  of  Doi  and  Edwards  (1986),  which  views  hydrodynam¬ 
ically  (  oncentrated  polymers  eis  each  confined  to  a  tube  with  streamwise  orientation.  The  enclosed 
polymer  moves  (reptates)  along  the  length  of  the  tube  on  a  long  time  scale  and  and  moves  along 

^  Recall  from  Section  4.3  that  the  vortex  creates  a  region  of  positive  streamwise  strain  rate  on  one  side  and  a  region 
of  negative  streamwise  strain  rate  on  the  other  side.  Therefore  the  vortex  crossflow  naturtdly  convects  an  embedded 
polymer  from  one  region  to  the  other. 

*  A  hydrodynamictdly  concentrated  solution  is  one  in  which  there  are  significant  dynamical  interactions  between 
adjacent  particles,  even  though  the  net  volume  concentrettion  of  solute  may  remain  quite  low. 
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the  short  diameter  of  the  tube,  making  contact  with  neighboring  tubes,  on  a  much  shorter  time 
scale.  The  Ryskin/Batchelor  viewpoint  appears  sufficiently  consistent  with  this  current  thought  on 
near-neighbor  interactions  in  a  hydrodynamically  concentrated  solution  and  seems  applicable  to  our 
situation,  therefore  we  follow  their  argument  for  the  local  increase  in  flow  viscosity.  As  in  Ryskin 
(1987),  we  begin  with  an  equivalent  form  of  Batchelor’s  (1971)  expression  for  the  viscosity  increase 
due  to  a  suspension  of  hydrodynamically  concentrated  rigid  rods  in  a  strain  rate  flow: 

=  .  +  (14) 

I/O  9  In  ^ 

where  i/q  is  the  kinematic  viscosity  of  the  solvent,  i/i  is  the  kinematic  viscosity  of  the  solution,  N  is 
the  number  density  of  polymers  per  volume,  and  $  is  the  hydrodynamically  effective  volume  concen¬ 
tration  based  on  a  cycle-averaged  polymer  deformation.  Ryskin  neglects  the  dependence  of  volume 
concentration  $  on  extension  ^  (as  we  do  for  highly  deformed  polymers  in  our  constant-polymer- 
volume  eissumption),  and  argues  that  increases  in  viscosity  are  significant  only  when  adjacent  poly¬ 
mers  are  hydrodynamically  interactive.  The  hydrodynaunically  effective  volume  concentration  of 
polymers  can  be  expressed  as: 


■i-(W 


where  is  the  maximum  stable  elongation  of  the  polymer  before  coil-stretch  transition  of  the 

polymer  begins,  marking  the  onset  of  supercritical  stretching.  The  length  ^  is  the  hydrodynsuni- 
cally  effective  radius  of  the  unstretched  polymer,  an  expression  derived  by  Rabin  et  al  (1985).  This 
effective  concentration  can  also  be  approximated  using  Einstein’s  result: 

(16) 

where  C  is  the  concentration  of  polymers  by  weight  and  [?;]  is  the  intrinsic  viscosity.^  Substituting 
the  effective  concentration  equations  15  and  16  into  the  viscosity  equation  14  yields  an  expression 
for  the  increase  in  viscosity: 


=  1  + 


1.555CM  f  (max  effective 


where  ^max  effective  is  the  effective  maximum  elongation  (in  a  cycle-averaged  sense)  of  the  polymer  in 
the  flow.  From  our  polymer  model,  the  maximum  stable  elongation  of  a  polymer  of  spring  constant 
C  =  5  in  a  streamwise  strain  rate  flow  is  roughly  ~  4.  We  assume  that  the  effective  maximum 
elongation  in  a  strain  rate  flow  is  half  the  maximum  extension  when  flipping  and  entangling;  this  is 
roughly  ,  or  ^  (from  Figures  7  and  8).  Typical  polymer  concentration  emd  intrinsic  viscosity 
values,  such  as  in  the  sink  flow  experiment  of  James  and  Saringer  (1980),  yield: 


C[,l  »  (0,00CKI2j^)  (2500i^) 
«  0.05 

i/1  ,  1.555  0.05 


-  «  M  (18c) 

^0  ln||f  \4J 

«  1.8.  (18d) 

^Intrinsic  viscosity  is  defined  as  [tj]  =  limc_o  .  where  tjq  denotes  the  shear  viscosity  of  the  solvent  and  tjj 

denotes  the  shear  viscosity  of  the  solution. 
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Thus  it  seems  that  for  the  deformations  {mac  ~  60  that  we  observe  for  polymers  embedded  in  a 
typical  streamwise  dependent  vortical  flow  field,  the  local  viscosity  may  be  almost  doubled. 

What  effect  does  the  increase  in  local  viscosity  have  on  the  velocity  profiles?  The  rigorous  way 
to  address  this  question  would  be  to  adapt  our  fluid  equations  of  motion  to  account  for  variable 
viscosity  during  the  flow-evolution  calculations.  We  do  not  pursue  this  because  it  is  impractical  to 
fill  our  computational  domain  with  polymer  molecules,  auid  because  the  significant  memory  effects 
of  the  polymer  seem  to  thwart  attempts  to  characterize  the  local  deformation  of  polymers  (and  thus 
the  local  increase  in  viscosity)  by  using  local  flow  quantities.  Therefore  our  first  order  answer  as  to 
the  effect  on  the  velocity  profiles  is  to  say  that,  since  the  prominent  energy  dissipation  mechanisms 
of  polymers  are  likely  to  be  active  both  as  they  are  being  significantly  stretched  and  as  they  are 
interacting  when  collapsing  and  buckling  during  the  flipping  cycle,  therefore  the  viscosity  will  be 
approximately  doubled,  per  our  above  calculation,  over  the  core  of  the  vortex  (since  polymers  seem 
to  be  activated  principally  at  the  edge  of  the  vortex  core,  and  remain  active  for  some  period  there¬ 
after  in  regions  over  which  the  core  edge  has  already  swept).  However,  we  must  be  a  little  cautious 
here.  If  the  local  viscosity  is  doubled,  then  the  local  vortex  Reynolds  number  Ry  is  halved,  decreas¬ 
ing  the  streamwise  strain  rate,  thereby  decreasing  the  deformation  of  embedded  polymers,  thereby 
decreasing  the  local  viscosity,  etc.  Obviously,  a  rigorous  treatment  would  account  for  this  feedback 
mechanism;  omitting  the  feedback  from  calculations  means  that  our  observed  polymer  deformations 
and  inferred  viscosity  increase  can  only  be  considered  an  upper  bound  to  the  true  rheological  effect. 
As  an  extreme  case,  one  could  even  guess  that  the  tiniest  increase  in  viscosity  decreases  the  stream- 
wise  strain  rate  enough  so  that  supercritical  stretching  of  polymers  is  completely  halted,  meaning 
that  the  polymer  additives  have  practically  no  effect  on  the  flow.  However,  this  extreme  suggestion 
is  ruled  out  by  our  earlier  sensitivity  analysis.  Recall  from  Section  4  that  for  our  typical  initially- 
potential  vortices,  nearby  polymers  undergo  about  the  same  magnitude  of  supercritical  stretching 
for  vortices  of  strength  from  >  3  up  to  at  least  6.5.  Therefore  concentrated  vortices  of  strength 
Ry  =  6.5  (among  the  most  destabilizing  vortices  found  in  a  boundary  layer  flow)  can  double  the 
viscosity  due  to  polymer  activation  with  effects  that  are  generally  undiminished  by  this  feedback 
mechanism.  However,  similar  vortices  which  begin  at  weaker,  strengths,  or  more  diffuse  vortices 
whose  threshold  strength  seems  to  be  on  the  order  of  «  5  (bWh  of  which  are  less  destabilizing  in 
a  boundary  layer  flow),  may  induce  effects  that  are  limited  -  to  some  extent  not  determined  here  - 
by  feedback.  Therefore,  it  appears  that  in  streamwise  vortical  flows,  polymer  additives  can  reduce 
the  vortex  Reynolds  number  of  at  least  the  most  destabilizing  vortices  by  half.  This  significantly 
reduces  the  inflectional  profiles  within  the  vortex  core. 

We  now  have  a  fairly  complete  chain  of  evidence  for  turbulence  suppression  and  turbulent  drag 
reduction  by  dilute  polymer  additives.  We  are  led  to  surmise  that,  in  accordance  with  the  stability 
calculations  of  Pearson  (1985)  and  in  analogy  to  inviscid  stability  theory,  the  inflectional  velocity 
profiles  at  the  vortex  center  associated  with  vortices  of  strength  /?„  ~  6.5  are  unstable.  The  flow 
in  the  vicinity  of  a  vortex  experiences  a  local  increase  in  viscosity  due  to  polymer  deformation. 
We  infer  from  this  the  local  decrease  in  strength  of  the  associated  vortex  (to  Ry  ~  3)  via  polymer 
deformation,  and  thus  the  local  decrease  in  strength  of  the  associated  vortex,  the  reduced  inflectional 
velocity  profiles  within  the  vortex  core,  and  the  increased  stability  of  the  flow  within  the  region  of 
the  vortex. 
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Abstract.  We  propose  a  new  iterative  numerical  scheme  designed  for  massively  par¬ 
allel  processing  for  an  immiscible  displacement  in  a  naturally  fractured  reservoir.  The 
procedure  is  based  on  a  domain  decomposition  technique  applied  to  a  mixed  finite  ele¬ 
ment  approximation  of  the  problem;  the  domain  is  decomposed  into  individual  elements. 
Numerical  experiments  are  presented  to  illustrate  its  performance  on  a  CM-5  system. 


1  introduction 

High  quality  numerical  simulations  of  fluid  flow  in  petroleum  reservoirs  require  the 
use  of  increasingly  finer  grids  in  the  numerical  discretization  of  the  governing  system 
of  partial  differential  equations  so  that  a  large  number  of  length  scales  relevant  to 
the  problem  can  be  incorporated  into  the  simulations.  This  problem  is  critical  when 
inhomogeneities  are  present  and  their  influence  need  to  be  adequately  resolved. 

Detailed  two-dimensional  studies  of  the  effect  of  the  inhomogeneities  of  a  single 
porosity  medium  have  reached  the  limits  of  existing  serial  computers  [22],  [23].  For 
dual  porosity  models,  meaningful  studies  are  infeasible  on  serial  mauJiines. 

In  this  paper,  a  parallel  iterative  procedure,  specially  designed  for  massively  parallel 
processing,  is  proposed  for  the  numerical  solution  of  dual-porosity  models  for  immiscible 
flow  in  a  naturzdly  fractured  reservoir.  Two  implementations  were  performed,  one 
completely  portable,  adequate  for  MIMD  systems,  and  the  other  using  a  data-parallel 
programing  language  particular  to  a  Connection  Machine  Model  CM-5  using  SIMD 
control.  For  fluid  flow  simulations  with  grid  sizes  relevant  for  applications  we  find  a 
good  scalability  of  our  algorithm,  with  a  consistent  slightly  better  performance  of  the 
MIMD  version.  However,  through  the  use  of  vector  processing  units,  the  SIMD  code 
runs  faster. 

The  model  problem  treated  in  this  paper  corresponds  physically  to  a  waterflooding 
of  a  naturally  fractured  petroleum  reservoir  where  the  average  spacing  between  frac¬ 
tures  is  relatively  small  compared  to  the  reservoir  size.  With  the  terminology  adopted 
in  previous  works  the  particular  model  treated  herein  is  known  as  the  “medium  block 
model”  [16].  The  system  of  partial  differential  equations  governing  fluid  flow  in  this 
double  porosity  formulation  treats  the  flow  in  each  matrix  block  in  a  completely  par- 
allelizable  fashion.  The  part  of  the  system  describing  the  flow  in  the  fractures  is  then 
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aumerically  approximated  by  a  hybridized  mixed  finite  element  method.  A  domain  de¬ 
composition  technique  in  which  the  domain  is  decomposed  into  individual  elements  is 
then  applied.  This  allows  us  to  adapt  the  solution  of  the  problem  to  massively  parallel 
processing.  Domain  decomposition  techniques  distinct  from  the  one  used  here  can  be 
found  in  [7],  [24],  [25],  [19],  and  [18].  As  our  first  step  towards  full  three-dimensional 
fluid  flow  simulations,  we  consider  in  the  numerical  studies  described  in  this  work  two- 
dimensional  fractured  media  to  which  are  attached  three-dimensional  matrix  blocks. 

We  define  a  nonlinear  iterative  procedure  and  use  it  to  solve  numerically  the  part  of 
the  system  describing  flow  in  the  fractures  which  is  coupled  to  the  system  of  equations 
for  the  matrix  blocks  through  source  terms.  This  method  is  motivated  by  the  linear 
problem  analyzed  in  [17],  which  is  closely  related  to  the  one  introduced  in  [9]  for  a 
Helmholz  problem  and  extended  to  another  Helmholz-like  problem  related  to  Maxwell’s 
equations  [8],  [10].  As  in  the  above  references,  we  shall  make  use  of  the  hybridization  of 
mixed  finite  element  methods  introduced  in  [21]  and  [20]  more  than  twenty-five  years 
ago  and  which  has  been  carefully  analyzed  in  [1];  see  also  [4],  [2],  and  [3].  For  the 
numerical  solution  of  the  local  problems  associated  with  the  matrix  blocks,  a  simple 
finite  difference  scheme  [16]  will  be  used.  A  rigorous  proof  of  convergence  of  the  iterative 
procedure  is  currently  being  investigated  by  the  authors.  For  the  simpler  problem  of 
two-phase  flow  through  a  single  porosity  medium,  convergence  of  the  iteration  has  been 
established. 

This  paper  is  organized  in  the  following  way.  In  §2  a  brief  description  of  the  model 
considered  here,  along  with  a  time  discretization  for  it,  is  given.  A  domain  decom¬ 
position  technique  and  the  new  iterative  procedure  defined  for  a  mixed  finite  element 
approximation  of  the  system  of  equations  in  the  fractures  appear  in  §3.  A  detailed  de¬ 
scription  of  the  time-dependent  algorithm  developed  to  solve  the  full  governing  system 
appears  in  §4.  Distinct  parallel  implementations  of  our  numerical  method  in  a  CM-5 
system  are  analyzed  in  §5.  Finally,  §6  is  devoted  to  our  conclusions  and  interesting 
open  problems  related  to  this  work. 

2  THE  MODEL  PROBLEM 
2.1  Governing  Equations 

We  consider  saturated,  two-phase,  incompressible,  immiscible  flow,  the  phases  being  o 
(oil  or  nonwetting  phase)  and  w  (water  or  wetting  phase),  with  densities  and  viscosities 
Pa  and  Pa,  01  =  o,w,  respectively.  See  [15]  in  this  volume  for  a  description  of  the 
system  of  equations  governing  fluid  flow  in  a  single  porosity  model  under  the  above 
assumptions. 

The  governing  system  for  the  medium  block  model  is  derived  through  the  mathe¬ 
matical  theory  of  homogenization  [12].  It  produces  a  two-phase,  single-porosity  model 
for  the  flow  in  the  matrix  system  and  a  second,  slightly  modified  single-porosity  system 
in  the  fractures.  To  reduce  the  number  of  subscripts  in  the  notation,  we  use  capital 
letters  to  indicate  quantities  in  the  fractures  and  small  letters  to  indicate  those  in  the 
matrix  blocks. 

Let  Da  denote  the  block  attached  to  the  point  x  €  D;  the  m-saturation  in  D*  will 
be  indicated  by  s(x,  y,  t),  x  €  D,  y  €  D*,  t>Q,  etc. 

The  capillary  pressure  and  relative  permeability  functions  are  somewhat  different 
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in  the  fractures  than  in  the  matrix  blocks.  Generally,  one  assumes  that  the  fractures 
are  essentially  like  spaces  between  two  parallel  planes  and  that  Smin  =  0  and  5m«v  =  1. 
The  singularity  in  the  capillary  pressure  curve  as  S  decreases  to  5n,in  =  0  is  weaker 
than  that  for  the  capillary  pressure  function  in  the  blocks,  and  the  relative  permeability 
functions  can  be  taken  to  be  linear  or  nearly  linear.  The  absolute  permeability  tensor 
on  the  fracture  sheet  reflects  the  geometry  of  the  blocks  [11]. 

The  source  terms  in  the  saturation  and  pressure  equations  in  the  fractures  contain 
two  terms,  one  defining  the  external  flow  (wells  in  practice).  In  addition,  there  are 
matrix  source  terms  Qm,  a,  oi  =  o,  w,  for  each  of  the  phases.  The  system  governing  flow 
in  the  fracture  system  can  be  written  as 

Qext,w  "I"  Qm,w  for  ^  ^  t  ^  0,  (2. In) 

— A,t,(5)V'^u,  for  X  e  t  >  0,  (2.16) 

9e*t,o  +  Qm,o  iOTXe  Cl,  t>  0,  (2.1c) 

-Ao(5)V^o  for  X  6  n,  t  >  0,  (2. Id) 

Pc\^c  +  ipo-  Pw)9z),  (2.1e) 

Incompressibility  requires  that  qm,o  +  Qm,w  =  0. 

It  is  convenient  to  write  the  equations  on  the  block  fix  as 

=  0  for  y  €  fix,  t  >  0,  (2.2a) 

=  0  for  y  G  fix,  t  >  0,  (2.26) 

=  +  {po  -  Pw)9z);  (2.2c) 

we  have  assumed  that  the  external  sources  affect  the  fracture  system  only.  The  bound¬ 
ary  conditions  for  the  matrix  problems  are  given  by  requiring  continuity  of  the  poten¬ 
tials: 


-V  ■  [X{sWw  +  A„(s)V^o] 


Qr.  = 


Qo 

s 


and 


i’w{x,y,t)  =  ^„,(x,t)  for  y  G  9fix,  X  G  fi,  t  >  0, 
=  '^c{x,t)  for  y  G  5fix,  x  G  fi,  t  >  0. 


(2.3a) 

(2.36) 


The  matrix  source  terms  are  defined  as  follows.  The  volume  of  the  w-fluid  leaving  the 
block  fix  is 

/  •  n  da(y)  =  /  V  •  dy  =  -  /  </>—  dy; 

JdUx  Jill  Jitx  ot 


consequently,  let 


9m,ui(^,  ^)  —  l(-j  if  ^  cu  ^  ^  ^  ^ 


lfi 


(2.4) 


We  complete  the  model  by  specifying  the  external  boundary  conditions  and  the 
initial  conditions  for  the  system.  For  the  case  of  no  fiow  across  the  external  boundary, 


Aq(s)  V  'i'a  •  n  =  0  for  X  G  dCl,  t  >  0,  a  —  o,w. 
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Initial  saturations  (i.e.,  capillary  potentials)  must  be  specified: 

^c(3J,0)  =  ^init,c(^)  a:  €  D, 

V’c(a:,  y,  0)  =  V'init.c(^»  v)  y  €  Dx,  a:  6  D. 

To  be  consistent,  (2.3)  and  (2.5)  should  hold  when  t  =  0. 

2.2  Time-Discretization 

Discretize  the  time  variable  by  choosing  such  that  0  =  t°  <  < 

. . .  <  ,  and  set  A<"  =  An  approximation  to  a  function  0  related  to  the 

fracture  system  at  a  point  a:  €  D  at  time  will  be  denoted  by 

0"  «  0(x,r). 

Approximate  (2.1)  implicitly  by  backwards  Euler  approximations  in  time  to  obtain 
the  system 


5"  -  5”'^ 


$ - - - 

At^ 

+  V  ■  <31  - 

for  X  e  D,  t>  0, 

(2.6a) 

Cn  _  gn-l 

<31  =  -A„(S”)V4'1 

for  X  6  D,  t>  0, 

(2.66) 

$ - - - 

At" 

+  V  •  „  for  x€Q,  t>  0, 

(2.6c) 

<31  =  -A.(S")V*1 

for  X  €  D,  t>  0, 

(2.6d) 

5"  = 

-  pw)gz), 

(2.6e) 

The  time-discretization  of  the  equations  describing  the  flow  in  the  matrix  blocks 
will  be  discussed  in  the  context  of  a  finite  difference  discretization  of  the  matrix  system 
in  §4. 

3  DOMAIN  DECOMPOSITION  FOR  THE  FRACTURE  SYSTEM 

Parallelization  of  the  solution  of  the  global  fracture  system  problem  is  achieved  through 
a  spatial  decomposition,  which  we  now  describe. 

Let  D  C  d  =  2  or  3,  be  a  bounded  domain  with  a  Lipschitz  boundary  dQ.  Let 
{Dj,  j  =  1,. M}  be  a  partition  of  fl: 

n  =  n  Dfc  =  0,  k. 

Assume  that  dClj,  j  =  is  also  Lipschitz  and  that  Dj  is  star-shaped.  In 

practice,  with  the  exception  of  perhaps  a  few  Uj's  along  dU,  each  Qj  would  be  convex 
with  a  piecewise-smooth  boundary.  Let 

r  =  5f2,  Pj  =  r  n  dQj,  Fjk  =  r*,  =  dQj  n  dQk- 

Let  us  consider  decomposing  (2.6)  over  the  partition  {Dj}.  In  addition  to  requiring 
{5”},  ^  =  ty,o,  j  =  l,...,Af,  to  be  a  solution  of  (2.6)  for  x  e  fij, 

j  =  1, . . . ,  M  it  is  necessary  to  impose  the  consistency  conditions 

X  eTjk,  a  =  w,o, 

QaJ  '  4"  Qa,k  *  —  Oj  X  €  OC  =  W,  0, 

where  Uj  is  the  unit  outer  normal  to  Qj. 

332 


(3.1a) 

(3.16) 


A  Massively  Parallel  Iterative  Numerical  Algorithm  for  Immiscible 
Flow  in  Naturally  Fractured  Reservoirs 


3.1  Weak  Formulation 


Let  Vj  =  H{div,ilj)  and  Wj  =  for  j  =  The  weak  formulation  of 

(2.6)  with  the  domain  decomposed  according  to  the  discussion  above  is  given  by  seeking 
{S^,  Qlj,  eWjxVjXVjXWjxWjJ^l,...,  M,  such  that 


=  +  {qm,wy'U>lhj, 

{Pc{S^),W3){i.  =  {^l^,W3)sij  +  ({po  -  Pw)9Z,W3hi, 


(3.2a) 

(3.26) 

(3.2c) 

(3.2d) 

(3.2e) 


where  vi,V2  €  Vj  and  wi,W2,‘U}3  e  Wj.  There  is  a  technical  difficulty  with  (3.26)  and 
(3.2d);  the  meaning  of  the  restriction  of  an  L^-function  on  fl*  to  Fj*  is  not  clear. 
Thus,  (3.2)  is  properly  viewed  as  motivation  for  the  treatment  of  the  discrete  case  to 
be  discussed  below. 


3.2  Mixed  Finite  Element  Approximation 

We  shall  treat  the  case  in  which  {flj}  is  a  partition  of  fl  into  individual  elements 
(simplices,  rectangles,  prisms),  though  an  inspection  of  the  procedure  would  indicate 
that  larger  subdomains  are  permissible.  Let  x  be  a  mixed  finite  element  space 
over  {flj};  any  of  the  usual  choices  is  acceptable;  [4],  [2],  [3],  [6],  [26],  [28],  [29].  Each 
of  these  spaces  is  defined  through  local  spaces  Vj*  x  —  V{Qj)  x  W{Qj),  and  setting 

=  {t;€if(div,n):v|n,  €F/}, 

=  {w  :  wjn,  €  Wj). 

In  each  space  in  the  various  families  of  mixed  elements  referenced  above,  the 

functions  w  e  are  allowed  to  be  discontinuous  across  each  As  a  consequence, 

attempting  to  impose  the  consistency  conditions  (3.1)  would  force  a  fiux  conservation 
error;  i.e.,  (3.16)  would  not  be  satisfied  unless  the  approximate  solution  €  W^,  a  = 
w,  o,  to  the  discrete  analogue  of  (3.16)  is  constant,  a  totally  uninteresting  case.  So,  let  us 
introduce  Lagrange  multipliers  [21],  [20],  [1]  on  the  edges  {Fjifc}.  In  the  discussion  below 
we  consider  the  parameter  a  to  be  either  o  or  w.  Assume  that,  when  Qqj  =  Qaj  hj , 
Qa,j  ^  normal  component  Qaj  ■  ^j  on  Fj*,  is  a  polynomial  of  some  fixed  degree 

Tq,  where  for  simplicity  we  shall  assume  Tq  independent  of  Fj*  (see  [5]  if  not).  Set 

Aq  =  {Aa  :  AqIp^j^  €  Fr„(Fjfc)  =  AQjfc,Fjfe  ^  0}; 

note  that  there  are  two  copies  of  Pr.  assigned  to  the  set  F^fe:  h.a,jk  and  h-a^kj- 
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The  hybridized  mixed  finite  element  method  is  given  by  dropping  the  superscript  h 
and  seeking 

{5"  €  W,,Ql^  e  £  tVj.AV,  e 

where  j  =  1, . . . ,  M;  A:  =  1, . . . ,  Af;  a  =  w,o,  such  that 


\Au)(Oj  )  '  iij  ic 

=  (C,o.^2)n,  +  (9m,o.«^2)n,, 

‘‘*'^^2)0,  +  'EiK,jk^'^2  ■  I'Wj,  =  0, 

\l\o\^j  )  /  flj  J(j 

(Pc(5’*),  Wshj  =  ('J'cj.  "^3)0,  +  {{po  -  Pw)9Z,  Wihj, 


(3.3a) 

(3.36) 

(3.3c) 

(3.3d) 

(3.3e) 


where  Vi,V2  €  Vj  and  Wi,  W2,  W3  €  Wj. 


3.3  The  Iterative  Method 

In  order  to  define  an  iterative  method  for  solving  the  above  system  [9],  [8]  it  is  conve¬ 
nient  to  replace  (3.36)  and  (3.3d)  by  the  Robin  transmission  boundary  condition 

-  PQaJ  ■  +  ^a,j  =  PQa,k  '  l^k  +  X  £  T jk  C  dQj,  O  =  W,  0,  (3.4a) 

-PQa,k  •  I^k  +  ^a,k  =  l3Qo.,j  •  x^TkjC  d^k,  a  =  w,o,  (3.46) 

where  f3  is  a,  positive  (normally  chosen  to  be  a  constant)  function  on  UTjk- 

Now  we  formulate  an  iterative  version  of  the  finite  element  approximation  of  (3.3) 
with  consistency  conditions  given  by  (3.4).  Consider  the  Lagrange  multiplier  to  be 
Aajifc,  a  =  w,o  as  seen  from  Qj  and  Xa,kj,  «  =  w,o  as  seen  from  Qk-  Then,  modify 
(3.4)  to  read 

~PQa,j  ■  4"  ^a,jk  ~  PQa,k  '  ^k  "I"  Xa^kji  ^  ^  Pjfc  C  d^j,  Ck  =  VJ,  O, 

PQa,k  ■  ^k  4"  ^a,kj  ~  PQa,j  '  4"  Xgijki  ^  ^  ^ kj  ^  Q!  —  W,  O, 


SO  that 


{Xajkj  V  ■  —  {P{Qa,j  '  d"  Qa,k  '  ^fc)  d"  Xa,kji  ^  ‘  Oc  —  W,  O. 

The  objective  of  a  domain  decomposition  iterative  method  is  to  localize  the  cal¬ 
culations  to  problems  over  smaller  domains  than  Q.  Here,  it  is  feasible  to  localize  to 
each  Qj  by  evaluating  the  quantities  in  (3.3)  related  to  ilj  at  the  new  iterate  level  and 
those  in  (3.3)  related  to  neighboring  subdomains  Q*  such  that  Tjk  ^  0  at  the  previous 
iteration  level.  Specifically,  the  algorithm  would  be  as  follows;  let,  for  all  j  and  k. 


s? 


-1 


e  e  Vj, 


^n-l 


ew, 


V 


\n-l  c  A 


K,k]  €  K,kj,  a  =  w,o, 
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seems  natural)  be  the  solution  of  the  discretized  system  of  equations  at 
some  discrete  time  (we  introduce  in  the  notation  the  superscript  i  which  is  an  iteration 
counter). 

Then  the  solution  propagated  by  one  time  step  is  given  as  the  limit  as  i  -♦  oo  of 
recursive  solutions  of  the  equations 


—  (9exs,u;’ ^Orij  (9m,u'» 

)  /  Slj  ^ 

=  “  -t- 

=  (9«!.o.^2)nj  +  (C.o.«'2)n,, 

ViVoPj  J  k 

=  -  ^{0Qofk~^  •  ^k  +  ^2  ' 


(3.5a) 


(3.5b) 


(3.5c) 


(3.5d) 


The  Lagrange  multipliers  are  updated  according  to 

AS  = 

AS  =  +  + 

and  finally  the  equation  for  the  capillary  pressure  is  linearized: 

(— 9S— 

_  (.fjj  _  +  ((p„  -  p^.)gz,W3)n,  -  (/’c{S"‘“'),W3)n,- 


(3.5e) 

(3.a) 


{3.5g) 


We  still  have  to  explain  how  the  matrix  source  terms  are  incorporated  into  the 
iterative  procedure.  We  will  postpone  this  discussion  to  §4. 

We  have  been  able  to  prove  the  following  theorem  concerning  the  convergence  of 
the  iterative  procedure  defined  above  in  the  simplified  context  where  matrix  blocks  are 
suppressed  from  the  model. 

Theorem  3.1  (Convergence  of  the  Iterative  Procedure)  Suppose  that  a  smooth 
solution  of  the  system  (2.1)  exists.  Then,  there  exists  a  constant  t*  such  that,  when 
At  <  t*, 

1)  the  iterative  scheme  (3.5)  converges:  i.e.,  there  exists 


J-  Ki.jk  ^  } 
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such  that 

IIS?'’  -  s?ii  +  iw:'‘  -  q;ji  +  iKj  -  «;jii  +  iiAjj.  -  a:j*ii  -  o 

as  i  —*  cx>;  moreover,  =  Xa,kj>  ^ 

2)  the  above  limit  converges  to  the  smooth  solution  in  the  sense  there  is  a  constant  c 
such  that 


iis(i”)  -  S’!!  +  iiQ,(«")  -  «:ii  +  ii*.(<")  -  »:ii  <  + A), 

where  0"(x)  =  B”(x),  for  x  G  Q,j  and  h  represents  the  partition  ofQ. 

A  rigorous  proof  of  the  above  statement  will  appear  elsewhere. 

4  THE  COMPUTATIONAL  ALGORITHM 

Our  numerical  procedure  will  combine  a  computationally  inexpensive  finite  difiFerence 
procedure  to  solve  the  local  problems  associated  with  each  matrix  block  with  a  hy¬ 
bridized  mixed  finite  element  method  applied  to  the  global  fracture  system  problem. 
The  fracture  and  matrix  systems  cannot  be  handled  sequentially,  since  a  small  change 
in  I  'le  boundary  values  on  each  matrix  block  can  cause  flow  of  a  volume  of  fluid  that 
IS  large  in  comparison  to  the  volume  of  the  fractures.  The  matrix-fracture  interaction 
for  the  medium  block  model  can  be  handled  implicitly  by  a  linearization  of  the  matrix 
i)roljh‘m.s  to  be  made  precise  below.  The  final  procedure  requires  solution  of  many 
small  linear  .systems,  each  corresponding  to  an  element  of  the  discretized  fracture  sys¬ 
tem  and  its  associated  matrix  block.  The  solution  of  these  small  and  uncoupled  linear 
systems  can  be  handled  easily  by  a  parallel  machine. 

Discretize  the  space  variables  by  defining  grids  over  and  over  each  matrix  block  Qx- 
We  consider  Q  and  x  £  Q,  to  be  rectangular  parallelepipeds;  more  general  domains 
can  be  treated  by  either  finite  difference  or  finite  element  techniques  quite  analogous 
to  the  methods  to  be  described  herein.  Suppose  that  D  =  [0,  £?i]  x  [0,  D2]  x  [0,  £>3]. 
Then,  divide  each  Dj  into  Nj  intervals,  which  for  simplicity  we  take  to  be  of  equal  size 
H]  =  Dj/Nj,  j  =  1, 2, 3.  Thus,  the  set 

4//  =  {xl  =  {LiHi+Hi/2,  L2H2+H2/2,  L3H3+H3/2)  :  Lj  =  0, 1, . . . ,  —  =  1, 2, 3}, 


consists  of  the  centers  of  the  elements  of  the  mixed  method. 

Again  for  notational  convenience  assume  that  the  matrix  blocks  are  all  of  the  same 
size  and  consider  a  grid  defined  on  the  matrix  block  Qx  which  will  be  used  in  a  finite 
difference  discretization  of  the  matrix  equations  to  be  discussed  below.  Let  hj  and  nj 
be  analogous  to  Hj  and  Nj  and  set 


Gm  —  {yt  —  (^1^1,  ^2^2?  ^3^3)  ■  =  0, 1, 2, . . 


Uj,  j  —  1, 2, 3}. 


Also,  let 


=  {ye  =  ^2^2>  4^3)  '■  ij  —  1,2, ..  .,nj  —  I,  j 


1,2,3} 
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indicate  the  interior  nodes  and  dGm  —Qm\%n  the  boundary  nodes.  (Advantage  should 
be  taken  of  any  symmetry  of  the  solution  on  a  matrix  block  to  allow  the  solution  to  be 
computed  only  at  necessary  nodes.) 

An  approximation  to  a  function  6  related  to  the  the  fracture  system  at  a  point 
xi^Qf  will  be  denoted  by 

02  «  eCxi.t"). 

and  for  a  function  6  associated  with  the  block  at  the  point  xi  e  Qf,  denote  the 
approximation  to  6  aX  yi€  Qm  by 


where  =  xi-yt  (this  places  a  top  corner  of  the  block  at  xl). 

The  matrix  equations  will  be  completely  linearized,  but  not  the  fracture  equations. 
The  discrete  matrix  system  is  directly  solvable.  The  four  parts  of  the  algorithm  be¬ 
low  uncouple  the  calculations  related  to  the  matrix  blocks  from  those  of  the  fracture 
calculation: 

i)  Initialization.  For  each  L  and  set 

=  ^init.e(^i)-  {Kl  +  (Po  -  Pw)9  zixi))  , 

Klj  =  ^init.c(^i.<)’  =  Pc  ^  +  (Po  -  P^)9  z{xi,t))  . 

ii)  Matrix  system.  For  each  I,  £,  and  for  n  >  1,  find  W  solving 


=0  it  W  € 

=0  if !(,  e  (4.14) 
=  “d  =  ifwe«;m,  (4.1c) 

and  determine  and  {^”2,,/,  solving 


4>(xt,,)  •  [a.(»"-')V«x<]  =0  if  !«  6  Im,(4.2a) 

•  [A(s"-')Vft,t,,l(;:  +  A,(s"-')V4,t,,^;]  =0  if  j,  €  I„,  (4.24) 
^c,L,i  =  1  and  =  0  if  y#  €  dQm,  (4.2c) 


-Vh,u  •  =0  if  y,  €  Jm, 

0  and  i'Cx./  =  ^  if  y^  ^  ^m,  (4.3o) 
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where 


These  equations  are  linear,  since  the  mobilities  and  p|.  are  evaluated  at  the  previous 
time  level.  The  matrix  potentials  and  f,  <  are  defined  below  in  (4.7)  and 
(4.8);  they  satisfy  the  expected  equations,  namely  (4.10).  Equations  (4.1  define  a 
particular  solution  to  the  linear  equations,  while  (4)  and  (4.3)  give  solutions  to  the 
homogeneous  problems  which  describe  unit  changes  in  the  boundary  conditions. 

iii)  The  Iterative  Procedure. 

a)  The  matrix  source  term.  For  each  L  and  n  >  1,  compute 


c: 


,n.» 


= €.l.i + (»?,i  -  m') 

’  Pj'  +  (P.  -  Pm,)94^l.i))  , 


n,i 

"m,w,L 


in. 


t 


^L,t  ~  ^L,e 

At” 


n—1 


Vi, 


(4.4) 


where  V}  is  the  volume  element  associated  with  the  grid  point  i.  The  quantity 
Qm,w,L  given  implicitly  in  terms  of  the  fracture  potentials  at  the  nth  time  level; 
however,  in  view  of  (4.7)  and  (4.9)  below,  (4.4)  is  clearly  a  discretization  of  (2.4). 


b)  Fracture  System.  For  each  L  and  n  >  1,  solve  the  nonlinear  system  of  equations 
(3.5)  using  the  iterative  method  described  in  §3  for  S^,  Qa^i,  A”,  a  =  w,o 

by  computing  r  employing  iii)  above  (for  simplicity,  we  denote  the  set  of  four 
Lagrange  multipliers  associated  to  each  element  by  Aq).  The  no-flow  boundary 
conditions  of  (3.26)  and  (3. 2d)  are  imposed  by  considering  virtual  elements  outside 
the  computational  region  such  that 

Aa  ‘  =  AaiUe,’  a  =  w,o,  (4.5) 

Qa,L±ei  =0.  a  =  w,o,  (4.6) 

if  XL±ej  is  outside  the  reservoir. 

Since  physically  meaningful  capillary  pressures  are  nonnegative,  the  capillary  func¬ 
tions  should  be  extended  vertically  downward  at  S^ax  oi  Smax- 

iv)  Matrix  update.  For  each  L,  C,  and  n  >  1,  let 


-  «c”,z')  <i’iL + 

+ Kt  -  «ti')  -  K:i) 

W  (p.  “  Pi.)sp(*t/))  * 
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This  completes  the  time  step. 

The  above  algorithm  can  be  implemented  sequentially.  The  following  discrete  matrix 
problem  has  been  solved: 


=0  if  W  €  (4.10a) 

c]  =0  if  »,  6  I„,  (4.106) 

=  and  =  iiytedgm.  (4.10c) 


Assuming  that  the  wetting  fluid  is  the  denser,  it  should  be  noted  that  the  block  asso¬ 
ciated  with  the  fracture  point  xi  is  interpreted  to  lie  below  xi  for  imbibition,  the  case 
we  have  treated.  For  drainage  it  should  be  placed  above  xi\  otherwise,  fluid  is  trapped 
by  the  numerical  simulation  as  Pc  tends  to  zero. 

The  numerical  convergence  of  the  iterative  method  just  described  is  measured  in 
terms  of  a  relative  error  defined  in  terms  of  the  £2  norm  of  variables  describing  the  flow 
as  the  number  of  iterations  is  successively  doubled. 


5  PARALLEL  IMPLEMENTATIONS 

We  developed  a  serial  code  based  on  the  algorithm  described  in  the  previous  section 
and  validated  it  against  another  code  developed  independently  (which  uses  finite  dif¬ 
ferences)  in  [16].  In  order  to  minimize  execution  time  we  use  two  techniques  to  reduce 
the  number  of  iterations  required  for  convergence.  Different  procedures  are  required 
depending  on  the  time  step  number.  After  two  time  steps  have  been  solved,  a  quadratic 
extrapolation  in  time  [14],  [13]  reduces  drastically  the  number  of  iterations  required  for 
convergence.  A  different  procedure  has  to  be  adopted  for  the  two  initial  time  steps. 
We  used  a  variant  of  the  above  method.  Instead  of  extrapolating  in  time,  we  consider 
a  spatial  hierarchical  extrapolation.  We  solve  the  problem  in  a  family  of  nested  grids, 
interpolate  the  solution  according  to  the  finite  element  method  in  use  on  eL2h  grid,  and 
then  we  use  a  quadratic  extrapolation  as  a  function  of  the  grid  size.  Figure  1  illustrates 
this  procedure.  Using  the  hierarchical  extrapolation  we  typically  reduce  by  a  factor  of 
two  the  execution  time  for  the  initial  time  steps. 

We  consider  speedup  studies  through  simulations  of  waterflooding  calculations  in  a 
“five-spot”  geometry  with  gravity  effects  neglected. 

For  computational  simplicity,  the  fracture  calculations  are  two-dimensional  over  Q, 
though  the  matrix  calculations  would  remain  three-dimensional  over  each  if  gravity 
were  not  ignored.  Initially,  the  reservoir  contains  75%  oil  and  25%  water.  Water  is 
injected  uniformly  into  the  reservoir  along  one  corner  at  a  rate  of  one  pore-volume 
every  five  years. 
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The  following  data  are  held  fixed  for  the  computational  results  exhibited  below: 
Fluid  properties 

Viscosity  //„,  =  .5  cP  //<,  =  2  cP 

Density  =  1  g/cm^  Po  =  .7  g/cm^ 

Absolute  Permeabilities  K  =  1  darcy  k  =  0.05  darcy 

Porosities  $  =  .01  (f)  =  .1 

Residual  Saturations  (matrix)  Sro  =  15  Smi  =  -2 

Residual  Saturations  (fractures)  Sro  =  0  Srw  =  0 

The  capillary  pressure  functions  were  assumed  in  the  form 

Pe(5)  (1  -  S){y(s-^  - 1)  +  e}, 

Pc{s)  =  a((s  -  Sru,)"^  -  Pil  -  s)~^). 

So  ~  1  ^ro)  P  ~  ^tw)  i 

7  =  2.0  X  10^  dynes /cm 0  =  100 dynes /cm 
a  =  3.0  X  10^  dynes /cm 

The  relative  permeability  functions  in  the  fractures  were  chosen  to  be  linear,  with 
the  residual  saturations  taken  to  be  zero: 

KroiS)  =  1-5,  Kry,{S)  =  5. 

In  the  matrix  blocks  the  relative  permeabilities  functions  were  taken  to  be 

kro{s)  =  {1  -  (1  -  Sr„)‘*s}^ 

krwi^}  —  (1  ^rw)  ^rw)  • 

5.1  MIMD  Implementation 

The  MIMD  version  of  the  serial  code  described  in  the  previous  sections  is  implemented 
through  CMMD,  a  library  of  the  CM-5  and  uses  SPARC  microprocessors.  See  [27] 
for  additional  information  about  a  CM-5  system.  A  hostless  programming  model  is 
used,  in  which  each  node  receives  the  same  copy  of  a  code.  The  computer  code  is 
written  in  the  C  language  and  the  driver  of  the  program  is  written  in  terms  of  function 
pointers.  This  allows  us  to  assign  diff'^'^ent  functions  to  distinct  subdomains,  such 
that  subdomain  dependent  procedures  (like  injection  of  fluid  in  specified  positions  and 
imposition  of  boundary  conditions)  can  be  handled. 

The  computational  domain  is  decomposed  into  rectangular  regions.  Each  of  the 
subdomains  (which  in  general  will  contain  several  elements  of  the  discretized  fracture 
system  of  equations)  is  assigned  to  a  different  processor.  Each  processor  allocates  mem¬ 
ory  for  the  elements  contained  in  its  subdomain  and  for  a  buffer  zone  consisting  of  one 
layer  of  elements  outside  the  subdomain.  The  elements  contained  in  each  rectangular 
region  are  processed  sequentially,  using  the  algorithm  described  in  §3  with  a  modifica¬ 
tion  which  allows  exchange  of  information  between  nearest  neighbor  subdomains.  Once 
one  step  of  the  iterative  procedure  is  performed  on  each  element  within  a  rectangular 


340 


A  Massively  Parallel  Iterative  Numerical  Algorithm  for  Immiscible 
Flow  in  Naturally  Fractured  Reservoirs 


> 

I 


T — 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 

0  Ih  2h  3h  4h  5h  6h  7h  8h 

Grid  Size 


Figure  1:  The  hierarchical  extrapolation.  A  guess  for  the  iterative  method 
is  computed  using  an  extrapolation  of  solutions  of  a  given  problem  in  coarser 
grids  as  a  function  of  the  mesh  size.  Given  a  problem  on  a  grid  with  mesh 
size  h  the  guess  is  computed  using  approximate  solutions  for  the  the  same 
problem  on  grids  with  sizes  2h,  Ah,  and  8/i. 
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Figure  2:  The  ratio  of  the  time  spent  by  a  SUN  SPARC  station  to  finish  a 
simulation  divided  by  the  time  spent  by  a  partition  of  the  CM-5  (running 
the  MIMD  version  of  our  code)  with  variable  number  of  processors  is  plotted 
against  n,  the  number  of  elements  in  one  direction  of  the  grid.  The  physical 
size  of  the  reservoir  is  increased  with  n,  keeping  the  mesh  size  fixed.  For 
the  largest  problem  considered  the  speedup  obtained  is  about  half  of  the 
number  of  processors  used. 


region  then,  through  a  sequence  of  grid  shifts  (right,  left,  up  and  down),  data  on  the 
boundary  of  subdomains  is  sent  (received)  to  (from)  neighboring  subdomains.  The 
boundary  conditions  (4.5)  and  (4.6)  are  also  set  at  this  stage  of  the  computation. 

We  addressed  the  problem  of  the  speedup  obtained  with  the  CM-5  in  two  studies. 
First,  we  compared  execution  times  for  simulations  performed  in  the  CM-5  with  the 
same  simulations  performed  in  a  SUN  SPARC  station.  In  Figure  2  we  plot  the  ratio 
of  the  time  spent  by  a  SUN  SPARC  station  to  the  time  spent  by  different  partitions  of 
the  CM-5  as  a  function  of  the  problem  size  (represented  in  the  plot  by  the  number  n 
of  points  in  one  direction  of  the  grid),  with  mesh  size  kept  fixed.  Note  in  Figure  2  that 
for  the  largest  grid  considered  (256  x  256)  the  speedup  obtained  is  about  half  of  the 
number  of  processors  used.  Next,  we  considered  the  speedup  curve  for  simulations  with 
large  grids.  We  considered  the  ratio  of  the  time  spent  by  partitions  of  the  CM-5  with 
256  and  512  processors  (to  perform  a  family  of  simulations  with  increasing  physical 
size)  to  the  time  spent  by  128  processors  as  a  function  of  the  number  of  processors 
used.  The  result  of  this  study  is  reported  in  Figure  4.  Note  in  Figure  4  that,  as 
the  problem  size  is  increased,  the  closer  the  speedup  curve  gets  to  the  perfect  (linear) 
speedup. 

5.2  SIMD  Implementation 

SIMD  control  in  a  CM-5  system  is  achieved  through  the  notion  of  virtual  processors 
and  implemented  with  data-parallel  programing  languages.  Our  code  was  developed 

342 


A  Massively  Parallel  Iterative  Numerical  ALGORITH^.  for  Immiscible 
Flow  in  Naturally  Fractured  Reservoir. 


32  mroccsaon 
—  128  Proccuon 
**■  256  Prooettors 


Figure  3:  The  ratio  of  the  time  spent  by  a  SUN  SPARC  station  to  finish  a 
simulation  divided  by  the  time  spent  by  a  partition  of  the  CM-5  (running 
the  SIMD  version  of  our  code)  with  variable  number  of  processors  is  plotted 
against  n,  the  number  of  elements  in  one  direction  of  the  grid.  For  problems 
with  small  grids  increasing  the  number  of  processors  has  little  effect  on 
execution  time.  For  the  largest  problem  considered  the  speedup  obtained 
is  more  than  the  number  of  processors  used. 


using  the  language  C*.  Vector  processors  were  used  to  run  the  SIMD  version  of  our 
code. 

The  C*  program  is  quite  similar  to  serial  code.  We  used  grid  communication  within 
C*  to  perform  the  necesssary  exchange  of  information  and  the  “where”  statement  to 
set  boundary  conditions. 

Again,  we  considered  the  problem  of  the  speedup  obtained  with  the  CM-5  in  two 
studies.  An  study  analogous  to  the  one  described  in  Figure  2  for  the  MIMD  version 
of  our  code  is  the  content  of  Figure  3.  Note  in  Figure  3  that  as  n  is  increased  the 
performance  of  the  CM-5  increases.  The  speedup  obtained  for  the  largest  problem  size 
considered  (256  x  256)  in  this  study  is  more  than  the  number  of  vector  processors 
used.  Next,  we  considered  the  speedup  curve  for  simulations  with  large  grids.  This 
study  appears  in  Figure  4.  As  explained  above,  the  ratio  of  the  time  spent  by  partitions 
of  the  CM-5  with  256  and  512  processors  to  the  time  spent  by  128  processors  is  plotted 
against  the  number  of  processors  used.  As  we  noted  above  for  the  MIMD  version  of  our 
code,  as  the  problem  size  is  increased  the  closer  the  speedup  curve  gets  to  the  perfect 
(linear)  speedup. 

Figure  4  also  allows  us  to  compare  the  speedup  obtained  with  the  two  parallel 
implementations  reported  here.  Although  the  execution  times  of  the  SIMD  version  are 
about  half  of  the  MIMD  version  (due  to  the  use  of  vector  processing  units)  Figure  4 
shows  a  better  scalability  of  the  MIMD  version. 
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6  CONCLUSIONS  AND  OPEN  PROBLEMS 

We  proposed  a  new  numerical  method  to  solve  the  system  of  equations  governing 
immiscible  flow  in  naturally  fractured  reservoirs  in  massively  parallel  computers.  The 
numerical  method  uses  spatial  decomposition  in  the  context  of  an  iterative  procedure  to 
solve  the  global  problem  associated  with  the  part  of  the  system  governing  fracture  flow. 
By  decomposing  the  domain  into  the  elements  of  the  mixed  finite  element  method  used 
in  the  discretization  of  fracture  equations  the  numerical  scheme  can  be  easily  translated 
into  a  computer  code  written  in  data-parallel  languages. 

The  new  numerical  procedure  was  implemented  first  in  a  serial  machine,  and  nu¬ 
merical  simulations  were  performed  to  validate  the  code.  Then,  the  serial  code  was 
restructured  and  implemented  in  a  CM-5  system,  both  in  MIMD  and  SIMD  modes. 
We  found  a  good  scalability  of  the  two  versions  of  the  parallel  code  for  problems  with 
grid  sizes  relevant  in  applications. 

We  established  the  convergence  of  the  new  numerical  procedure  for  a  simplified 
version  of  the  model  discussed  here,  and  the  proof  of  convergence  for  the  full  system 
constitutes  an  interesting  open  problem.  Prom  the  numerical  point  of  view,  the  new 
numerical  method  will  allow  us  to  study  multi-length  scale,  stochastic,  double  porosity 
models,  due  to  the  high  resolution  provided  by  the  CM-5  through  the  use  of  fine 
computational  grids.  Obviously,  full  three-dimensional  fluid  flow  simulations  remain 
as  one  of  the  most  interesting  challenges  of  our  research  area;  such  simulations  are 
currently  being  pursued  by  the  authors. 

REFERENCES 

[1]  Arnold  D.  N.,  Brezzi  F.  Mixed  and  nonconforming  finite  element  methods:  imple¬ 
mentation,  postprocessing  and  error  estimates.  RAIRO,  19:7-32,  1985. 

[2]  Brezzi  F.,  Douglas  J.  Jr.,  Duran  R.,  Fortin  M.  Mixed  finite  elements  for  second 
order  elliptic  problems  in  three  variables.  Numer.  Math.,  51:237-250,  1987. 

[3]  Brezzi  F.,  Douglas  J.  Jr.,  Fortin  M.,  Marini  L.  D.  Efficient  rectangular  mixed  finite 
elements  in  two  and  three  space  variables.  R.A.I.R.O.  Modelisation  Mathematique 
et  Analyse  Numerique,  21:581-604,  1987. 

[4]  Brezzi  F.,  Douglas  J.  Jr.,  Marini  L.  D.  Two  families  of  mixed  finite  elements  for 
second  order  elliptic  problems.  Numerische  Mathematik,  47:217-235,  1985. 

[5]  Brezzi  F.,  Douglas  J.  Jr.,  Marini  L.  D.  Variable  degree  mixed  methods  for  second 
order  elliptic  problems.  Matemdtica  Aplicada  e  Computacional,  4:19-34,  1985. 

[6]  Chen  Z.,  Douglas  J.  Jr.  Prismatic  mixed  finite  elements  for  second  order  elliptic 
problems.  Calcolo,  26:135-148,  1989. 

[7]  Cowsar  L.  C.,  Wheeler  M.  F.  Parallel  domain  decomposition  method  for  mixed 
finite  elements  for  elliptic  partial  differential  equations.  In  Proceedings  of  the 
Fourth  International  Symposium  on  Domain  Decomposition  Methods  for  Partial 
Differential  Equations,  Philadelphia,  1991.  SIAM.  R.  Glowinski,  Y.  Kuznetsov,  G. 
Meurant,  J.  Periaux,  and  O.  B.  Widlund,  eds. 

345 


Jim  Douglas,  Jr.,  P.  J.  Paes  Leme,  Felipe  Pereira,  Li-Ming  Yeh 


(8)  Despr4s  B.  Domain  decomposition  method  and  the  Helmholz  problem,  pages  44-52. 
SIAM,  Philadelphia,  1991.  G.  Cohen,  L.  Halpern,  and  P.  Joly  (eds.). 

[9]  Despres  B.  Methodes  de  decomposition  de  domaines  pour  les  problemes  de  propa¬ 
gation  d’ondes  en  regime  harmonique.  PhD  thesis,  Universite  Paris  IX  Dauphine, 
UER  Mathematiques  de  la  Decision,  1991. 

[10]  Despres  B.,  Joly  P.,  Roberts  J.  E.  A  domain  decomposition  method  for  the  har¬ 
monic  Maxwell  equations,  pages  475-484.  Elsevier  Science  Publishers  B.  V.  (North- 
Holland),  Amsterdam,  1992.  R.  Beauwens  and  P.  de  Groen  (eds.). 

[11]  Douglas  J.,  Jr.,  Arbogast  T.  Dual  porosity  models  for  flow  in  naturally  fractured 
reservoirs.  In  Dynamics  of  Fluids  in  Hierarchical  Porous  Formations,  pages  177- 
221.  Academic  Press,  London,  1990.  J.  H.  Cushman,  ed. 

[12]  Douglas  J.  Jr.,  Arbogast  T.,  Paes  Leme  P.  J.,  Hensley  J.  L.,  Nunes  N.  P.  Immiscible 
displacement  in  vertically  fractured  reservoirs.  Transport  in  Porous  Media.  To 
appear,  1993. 

[13]  Douglas  J.  Jr.,  Dupont  T.,  Ewing  R.  E.  Incomplete  iteration  for  time-stepping  a 
nonlinear  parabolic  Galerkin  method.  SIAM  J.  Numer.  Anal.,  16:503-522,  1979. 

[14]  Douglas  J.  Jr.,  Dupont  T.,  Percell  P.  A  time-stepping  method  for  Galerkin  ap¬ 
proximations  for  nonlinear  parabolic  equations.  In  Numerical  Analysis,  Dundee 
1977,  volume  630  of  Lecture  Notes  in  Mathematics.  Springer- Verlag,  Berlin,  1978. 

[15]  Douglas  J.  Jr.,  Hensley  J.  H.,  Paes  Leme  P.  J.  A  study  of  the  effect  of  inhomo¬ 
geneities  on  immiscible  flow  in  naturally  fractured  reservoirs.  In  Porous  Media. 
Birkhauser,  Basel,  1993. 

[16]  Douglas  J.,  Jr.,  Hensley  J.  L.,  .Arbogast  T.  .4  dual-porosity  model  for  waterflood¬ 
ing  in  naturally  fractured  reservoirs.  Computer  Methods  in  Applied  Mechanics  and 
Engineering,  87:157-174.  1991. 

[17]  Douglas  J.  Jr.,  Paes  Leme  P.  J.,  Roberts  J.  E.,  Wang  J.  .4  parallel  iterative  pro¬ 
cedure  applicable  to  the  approximate  solution  of  second  order  partial  differential 
equations  by  mi.xed  finite  element  methods.  Numerische  Mathematik.  To  appear, 
1993. 

[18]  Ewing  R.  E.,  Wang  J.  Analysis  of  multilevel  decomposition  iterative  methods  for 
mi.xed  finite  element  methods.  R.A.I.R.O.  Modehsation  Mathematique  et  Analyse 
Numerique.  Submitted. 

[19]  Ewing  R.  E.,  Wang  J.  Analysis  of  the  Schwarz  algorithm  for  mixed  finite  element 
methods.  R.A.I.R.O.  Modehsation  Mathematique  et  Analyse  Numerique,  26:739- 
756,  1992. 

[20]  Fraeijs  de  Veubeke  B.  X.  Stress  function  approach.  International  Congress  on  the 
Finite  Element  Method  in  Structural  Mechanics,  Bournemouth,  1975. 

[21]  Fraeijs  de  Veubeke  B.  X.  Displacement  and  equilibrium  models  in  the  finite  element 
method.  In  Stress  Analysis.  John  Wiley,  New  York,  1965.  O.  C.  Zienkiewicz  and 
G.  Holister  (eds.). 


346 


A  Massively  Parallel  Iterative  Numerical  Algorithm  for  Immiscible 
Flow  in  Naturally  Fractured  Reservoirs 


[22]  Glimm  J.,  Lindquist  B.,  Pereira  F.,  Peierls  R.  The  fractal  hypothesis  and  anoma¬ 
lous  diflFusion.  Matemdtica  Aplicada  e  Computacional,  11:189-207,  1992. 

[23]  Glimm,  J.,  Lindquist  B.,  Pereira  F.,  Zhang  Q.  A  theory  of  macrodispersion  for 
the  scale  up  problem.  Advances  in  Water  Resources.  To  appear. 

[24]  Glowinski  R.,  Kinton  W.,  Wheeler  M.  F.  Acceleration  of  domain  decomposition 
algorithms  for  mixed  finite  elements  by  multi-level  methods.  In  Third  International 
Symposium  on  Domain  Decomposition  Methods  for  Partial  Differential  Equations, 
pages  263-290.  SIAM,  Philadelphia,  1990.  R.  Glowinski  (ed.). 

[25]  Glowinski  R.,  Wheeler  M.  F.  Domain  decomposition  and  mixed  finite  element 
methods  for  elliptic  problems,  pages  144-172.  SIAM,  Philadelphia,  1988.  R. 
Glowinski,  G.  Golub,  G.  Meurant,  and  J.  Periaux  (eds.). 

[26]  Nedelec  J.  C.  Mixed  finite  elements  in  R^.  Numer.  Math.,  35:315-341,  1980. 

[27]  Palmer  J.,  Steele  G.  L.  Jr.  Connection  Machine  Model  CM-5  system  overview. 
In  Proceedings  of  the  Fourth  Symposium  on  the  Frontiers  of  Massively  Parallel 
Computation,  pages  474-483.  IEEE  Computer  Society  Press,  Los  Alamitos,  1992. 

[28]  Raviart  P.  A.,  Thomas  J.  M.  A  mixed  finite  element  method  for  second  order 
elliptic  problems.  In  Mathematical  Aspects  of  the  Finite  Element  Method,  volume 
606  of  Lecture  Notes  in  Mathematics,  pages  292-315.  Springer- Verlag,  Berlin  and 
New  York,  1977.  I.  Galligani  and  E.  Magenes,  eds. 

[29]  Thomas  J.  M.  Sur  I’analyse  numerique  des  methodes  d’ elements  finis  hybrides  et 
mixtes.  PhD  thesis,  Universite  Pierre-et-Marie  Curie,  Paris,  1977. 


347 


NUMERICAL  SOLUTION  OF  RICHARDS’  EQUATION 


A.  J.  SILVA  NETO>  and  R.  E.  WHITE 
Departouint  of  Mathematics 
North  Caroliaa  State  University 
Raleigh,  NC  27695-8205,  USA 


SUMMARY 

The  model  for  fluid  flow  in  partially  saturated  porous  media  involves  a  nonlinear  parabolic 
partial  differential  equation  for  the  hydraulic  head  pressure,  i.e.  Richards’  equation.  We  focus 
on  problems  with  large  derivatives  of  the  moisture  content  and  hydraulic  conductivity  functions. 
Here  similarities  with  the  Stefan  problem  yield  the  successful  application  of  the  nonlinear  SOR 
method  to  Richards’  equation.  Implementation  issues  for  vector  and  mntiprocessor  computers  are 
also  discussed. 


Richards(1931)  developed  a  nonlinear  parabrdic  equation  that 
modds  fluid  flow  in  porous  media.  Richards’  equation  relies 
on  empirical  nonlinear  functions  such  as  the  moisture  content, 
6(h),  and  the  hydraulic  conductivity,  K(h)  : 

Q{h)t-VK{h)Vh-K(k),  =  0  (la) 


where  h  is  the  hydraulic  pressure  head,  and  s  is  the  vertical 
direction. 

The  boundary  and  initial  conditions  usually  have  the  form 
(Jir(h)V  (h  +  r)]  •  n  =  given  for  the  boundary  (lb) 


k  =:  given  for  t  =  0 


where  n  is  a  unit  outward  normal  to  the  space  domain. 

Richards’  equation  coupled  to  a  parabolic  system  for  the  con¬ 
taminants  becomes  the  basic  model  for  the  analysis  of  transport 
of  contaminants  in  groundwater  (Freese  and  Cherry, 1979;Feng, 
1993). 

Traditional  methods  (Paniconi  et  ai,1991)  of  solution  of  prob¬ 
lem  (1)  such  as  Newton,  Picard  and  Lees  implicit  factored 
schemes,  seem  to  work  well  for  problems  without  large  deriva¬ 
tives  in  6(h)  and  K{k).  But,  some  oscillations  can  devdop  for 
the  large  derivative  cases. 

Here  we  will  approximate  the  empirical  functions  6(h)  and 
K{k)  by  piecewise  linear  functions.  This  approach  is  justified 
by  the  often  significant  uncertainties  related  to  the  computsr 
tion  of  6(h)  and  Jir(h).  The  computation  times  are  reduced  by 
avoiding  the  calculation  of  exponential  terms  (van  Genuchten 
and  Nielsen,1985),  and  one  can  still  use  the  monotonidty  of 
6(h)  and  K{k)  to  analyse  convergence  and  obtain  bounds  for 
the  solutions  (i.e.  comparison  results). 

In  this  work  we  show  the  similarities  between  the  enthalpy 
formulation  of  the  Stefan  problem  and  the  analogous  moisture 

‘Pemaaent  address:  Ptomoa  Engeahaiia,  Av.  Pres.  Joseeliao  Ku- 
tatsehek  1830,  Sio  Paolo,  SP,  04S43.000,  Brasil 


formulation  of  the  Richards’  equation.  Then  we  focus  on  the 
solution  of  problem  (1)  using  a  nonlinear  SOR  set-valued  al¬ 
gorithm  (Cryer,1971).  We  will  also  discuss  high  performance 
computing  issues. 

MOISTURE  FORMULATION  OF  RICHARDS’  EQUATION 

The  empirical  functions  of  6(h)  and  K(k)  are  given  by  van 
Genuchten  and  Nielsen(1985)  in  the  form 

f  6,  +  (6,  -  6,K1+  I  ak  I-)—  ,  h  <  ho  (2a) 
6(h)  =  {  6,  -I-  (6.  -  6,K1+  I  aho  r)— 

I  +*»(  ■  ^)  ,  h  >  ho  (2b) 

K(k)  =  (  (l  -  (1  -  .  *  <  *0  (3*) 

[K.  .  h  >  ho  (3b) 


s(k)  =  /  =  (1+  I  (4») 

I  10  ,  h  >  ho  (4b) 


where  the  parameters  6,,  6.,  n,  m  =  1  -  a,  fiT,  and  t,  are 
function  of  the  particular  porous  media,  and  ho  is  chosen  so  that 
^  is  continuous  at  ho.  Often  6(h)  have  large  derivatives, 
and  ^  appears  to  be  discontinuous  in  numerical  calcnlatioiu. 
Also  luge  uncertainties  are  associated  with  functions  in  (2)  to 
(4)  because  of  the  unknown  nature  of  the  subsurface  coil. 

When  the  hydraulic  conductivity,  A,  is  a  function  of  h  only, 
a  Kirchhoff  transformation 


=  F(h)=  rK{k)dk 

Jo 


is  used  and  Richuds’  equation  becomes 


6(F-*{v)),  -  V*v  -  A(F-‘(o)).  =  0 


When  6  is  viewed  u  the  primary  unknown,  Eq.(6)  is  called 
the  moisture  formulation  of  Richards’  equation. 


Thil  if  anaiogotti  to  the  entlulpy  tonnulotioii  of  the  Stefhn 
problem  when  Q  ii  replaced  by  the  enthalpy  and  k  by  the  tem- 
peratore.  In  the  Stefan  problem  the  K,  term  would  be  equiva¬ 
lent  to  a  heat  source  at  the  phase  change  interface. 

For  the  remainder  of  this  paper  we  assume  two  space  dimen¬ 
sions,  with  y  replacing  s  on  the  vertical  direction. 

THE  FINITE  DIFFERENCE  METHOD 

As  in  the  Stefan  problem  for  long  time  durations,  the  implicit 
time  discretisation  is  used  in  Eq.(6), 


0m+l  _  ^ 

iSt 


_  -  j(r(v"*+*)„  =  0 


(7) 


The  FDM  for  the  interior  grid  points  is 


©{F->K))  ,  K(F-\vi^)) 
^ 


(8) 


where 


2  2 
A**  ^  Ay* 


(9) 

(10) 

(11) 


and  Vij  =  Vij  from  the  previous  time  step. 

Similar  equations  are  written  for  the  cells  at  the  boundsries. 
If  there  are  jump  discontinuities  in  either  Q(h)  or  K[h), 
£q.(8)  is  a  set-valued  equation  in  the  form 


d  —  cv  €  r(u) 


(12) 


Here  we  are  trying  to  find  v  so  that  d  —  c  v  is  an  element  of  the 
set  r(v).  As  depicted  in  Fig.l,  there  are  three  cases  to  consider. 
For  each  case  there  is  a  solution,  and  it  is  unique.  As  in  the 
enthalpy  formulation  of  the  Stefan  problem,  this  will  accurately 
track  the  moisture  in  regions  where  large  derivatives  or  jump 
discontinuities  are  located. 

NONLINEAR  SQR  FOR  SET- VALUED  SYSTEMS 

The  following  sdgorithm  is  applied  to  Eq.(8)  or  to  the  set¬ 
valued  system  (12),  where  maxit  =  maximum  allowable  SOR 
iterations,  =  number  of  cells  in  the  x  and  y  directions 

respectively,  1.0  <  (7  <  2.0  is  the  SOR  parameter  which  is  to 
be  applied  to  either  the  unsaturated  or  the  saturated  cells,  and 
0.8  <  u;  <  0.9  is  used  to  dampen  numerical  oscillations. 


Figure  1.  Set-valued  Equation. 


Nonlinear  SOR  Algorithm. 

for  k  =  1,  maxit 
for  i  =  1,  n. 
for  j  =  1,  n, 

compute  dij  from  Eqs.(8)  or  (12) 
solve  dif  —c  Vij  6  r(vy)  as  in  Fig.l 
if  u  Vij  >  0,  then 

Vij  =  (1  -  <?)««  +  2;  w 

else 

Vij  =  (1  -  Sd)w«  +  SsJ  w 
end  if 
end  loop  j 
end  loop  i 

check  for  convergence 
end  SOR  loop  k 

In  our  calculations  the  stopping  criterion  consisted  of  requir¬ 
ing  the  difference  for  calculated  v  between  successive  iterations 
to  be  less  than  e  fc*,  where  fi  =  max(Ax,  Ay)  and  e  ranged  from 
10"*  to  10”*.  Convergence  was  usually  obtuned  in  10  to  30 
iterations. 

HTOH  PERFORMANCE  COMPUTING 

We  tried  vectorization  of  the  CRAY  Y-MP  computer,  vector- 
ization  and  multiprocessing  on  a  two  processor  ALLIANT  FX- 
40,  and  multiprocessing  on  the  Kendall  Square  Research  KSRl 
computer. 

In  general  the  vectorization  did  not  work  well.  Here  we  used 
Red-Black  ordering  (White,1987),  but  the  computations  in  the 
inner  most  loops  were  too  complicated  to  vectorize  effectively. 
This  was  observed  on  both  the  CRAY  Y-MP  and  the  ALLIANT 
FX-40. 

In  order  to  use  multiprocessor,  a  domain-decomposition  tech¬ 
nique  (White, 1987)  is  applied,  as  illustrated  in  Fig.2.  We  or¬ 
dered  the  even  blocks  first  and  then  the  odd  blocks.  The  non- 
Unear  SOR  algurithm  is  then  executed  in  parallel  as  follows. 

2^r 


for  ik  s  1,  momU 

concnnently  do  nonlinemr  SOR  over  the  even  blocks 
vpdnte 

concurrently  do  nonlinenr  SOR  over  the  odd  blocks 
updnte 

check  for  convergence 
end  SOR  loop  k 

The  KSRl  computer  hu  three  pnmllcl  constructs  thnt  cun 
be  used  in  FORTRAN  code:  UUa,  paraUel  sections  and  paroikl 
rtgiont.  Tiles  are  used  to  partition  loops  and  are  very  effective 
for  simple  cases  such  as  matrix  multiplications.  Parallel  sections 
can  be  used  to  concurrently  execute  different  code  segments.  We 
used  parallel  regions,  in  which  code  segments  are  duplicated  to 
allow  concurrent  computations.  In  our  implementation  we  used 
teams  of  processors  that  are  assigned  in  the  initial  part  of  the 
program  due  to  the  high  overhead. 

Table  1  shows  a  comparison  of  the  CPU  time  on  the  KSRl 
computer  of  the  North  Carolina  Supercomputing  Center  for  the 
computation  of  the  first  time  step  of  Example  3  presented  in  the 
next  section.  Here  iV  =  n,  =  n,  is  the  number  of  grid  cells  in 
each  direction,  L  is  the  number  of  large  blocks  (L=i  in  Fig.2), 
Si  =  El  =  ^  »  ‘he 

efficiency. 

Table  1.  Speedup  for  Domain-Decomposition  SOR  on  KSRl 


JV 

L 

time  (s) 

Sl 

El 

80 

1 

8.74 

1.00 

1.00 

80 

2 

5.57 

1.57 

0.79 

80 

4 

3.02 

2.89 

0.72 

£0 

8 

2.22 

3.94 

0.49 

160 

1 

49.49 

1.00 

I.OO 

160 

2 

27.75 

1.78 

0.89 

160 

4 

14.41 

3.43 

0.86 

160 

8 

8.73 

5.67 

0.71 

We  observed  a  declining  efficiency,  Ei,  with  an  increasing 
number  of  large  blocks,  L.  This  is  consistent  with  Amdahl’s  law 
(Ortega, 1988).  A  second  important  observation  is  the  increase 
in  efficiency  with  the  increase  on  the  number  of  grid  cells,  N. 
This  fact  is  explained  by  the  relatively  smaller  ratio  of  amount 
of  parallel  overhead  to  the  amount  of  concurrent  computations. 

These  results  were  also  observed  for  other  computations  and 
indicates  that  the  KSRl  computer  can  be  used  efficiently  for 
larger  problems. 


To  validate  the  numerical  model  we  present  the  tcsuits  fm 
three  numerical  experiments.  For  the  first  two,  analytical  so¬ 
lutions  are  available,  and  for  the  third  one  a  compariaon  with 
experimental  data  is  d<me. 

For  all  computations  presented  here  we  used  piecewise  lin¬ 
ear  i^proximations  of  the  empirical  functions  (2)-(4)  for  the 
moisture  content,  d(h),  and  hydranlic  cmidnctivity,  K{k),  as 
shown  in  Fig.3.  These  approximations  allow  the  test  of  the 
method  under  most  strict  conditions  due  to  the  discontinuity 
of  the  derivatives  with  respect  to  the  pressure  head.  In  fact  it 
will  also  represent  savings  on  CPU  time  by  avoiding  the  quite 
involved  calculations  required  in  Eqs.(2)  to  (4). 

The  first  and  second  examples  are  in  one  space  dimension 
and  have  the  boundary  and  initial  conditions  in  the  form 

My.0  =  e‘-1  »‘»  =  0,  (13a) 

M».0  =  -  l)  »‘y=l.  (13b) 

h(y,t)  =  a  (e"»  -  l)  for  t  s  0  .  (13c) 

The  analytical  solu tiers  for  particular  choices  of  a  and  the 
parameters  in  Fig.3,  ha  .he  form 

e‘-*-l  ,  t>y  (14a) 

a(e‘-*-l)  ,  t<y  (14b) 

Example  1.  This  example  correspciMia  to  a  sttnatienm  which 
there  is  no  water  supply  bom  the  top  of  the  domain  and  them  is 
an  infiltration  from  the  bottom,  as  from  a  groundwater  sonree. 

The  parameters  used  in  the  piecewise  linear  approximaition 
of  6(h)  and  K(h),  as  shown  in  Fig.3,  were  Oi  =  1,  =  0, 
oj  =  2,  da  -  1,  Cl  =:  0,  h]  =  1,  ca  =  0  and  kj  s  2.  It  it 
considered  also  a  =:  0  in  Eqs.(13). 

In  the  numerical  experiments  we  used  20  and  40  grid  cells, 
and  At  =  Ay.  Convergence  was  uniformly  rapid  and  the  inter¬ 
face  was  always  located  to  within  one  grid  cell  (i.e.  in  all  runs  we 
were  able  to  keep  track  of  the  saturated/unsaturated  interface 
very  accurately). 

Some  oscillations  were  noted  near  the  interface,  which  is  sim¬ 
ilar  to  what  happens  with  the  enthalpy  method  for  the  Stefan 
problem.  In  fact  the  problem  at  hand  erm  be  viewed  as  a  one 
phase  Stefan  problem  with  a  source  at  the  interface. 


2.  Thif  example  conetponils  to  antaatioD  ia  wUcli 
there  U  a  water  supply  from  both  the  top  and  the  bottom  of  the 
domain. 

The  parameters  used  in  the  piecewise  Unear  approximation 
of  0(h)  and  K{k)  were  O)  =  |,  =  0,  o,  =  1,  dj  =  1,  cj  =  0, 

hi  =  |)  ct  =  0  and  hj  =  2.  It  is  considered  also  o  =  ^  in 
Eqs.(13). 

Similar  performance  to  Example  1  was  observed. 

Example  3-  The  repacked  Brindabella  silty  loam  has  data 
corresponding  to  Eqs.(2)  to  (4)  as  given  by  Feng(l993):  Kt  = 
0.118  m/h,  e,  =  0.110,  0.  =  0.485,  a  =  2.857,  n  =  1.8,  s.  =  0, 
and  ho  =  0. 

The  flux  at  the  top  boundary  was  taken  as  0.0165  m/h,  while 
the  other  boundary  segments  had  sero  flux.  The  initial  condition 
was  -8.0  m,  and  the  region  was  0.3  m  x  0.3  m.  We  attempted 
to  reproduce  the  data  given  by  White  and  Broadbridge(I988); 
therefore,  we  focused  on  the  portion  of  0(h)  and  K{h)  such 
that  0  <  0  425.  Here  we  used  a  coarse  approximation  of  the 
empirical  functions  0(A)  and  A'(h)  adopting  Oj  =  0.021,  di  = 
0.275,  oj  =  0.0,  e,  =  0.425,  hj  =  0.0165.  c;  =  0.0,  Cj  =  0.0,  and 
fci  =  0.0000!.  The  results  are  given  in  Fig. 4. 

The  nonlinear  SOR  algorithm  converged  with  no  difficulties 
until  saturation  was  reached.  The  differences  in  the  data  and 
the  computed  values  are  attributed  to  the  crude  approximation 
of  the  hydrauUc  conductivity  function. 

For  more  complicated  hydrauUc  conductivity  functions,  one 
should  not  use  the  Kirchhoff  transformation,  Eq.(5).  There  are 
two  reasons  for  this;  First,  the  solution  of  the  SOR  step  in  Eq.(8) 
or  (12)  becomes  more  complicated.  Second,  in  most  porous  me¬ 
dia  the  hy^.  aulic  conductivity  is  also  space  dependent,  in  which 
case  the  Kirc'ithoff  transformation  does  not  simplify  the  diffusion 
part  of  Richards'  equation. 

CONCLUSIONS 

The  nonlinear  SOR  algorithm  was  coupled  with  the  mois¬ 
ture  formulation  to  approximate  the  solution  of  R -hards’  equa¬ 
tion.  Here  special  attention  was  given  to  problerr  with  large 
derivatives  of  the  moisture  content  and  hydraulic  conductivity 
functions.  The  nonlinear  SOR  algorithm  converged  rapidly  for 
partially  saturated  regions.  Moreover,  it  did  adapt  very  well  to 
multiprocessing,  but  not  so  well  to  veci' irization 

Although  the  calculations  in  this  paper  were  for  one  and 
two  space  variables  and  for  homogeneous  porous  media,  there 
should  be  little  problem  in  generalizing  the  method  to  three 
space  variables  and  to  inhomogeneous  media  with  an  assortment 
of  nonlinear  terms. 
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ABSTRACT 

This  paper  describes  an  SOR  algorithm  for  solving  the  nonlinear  algdiraic  system  which 
evolves  from  Richards'  equation  that  models  fhiid  flow  in  a  porous  media.  The  moisture 
content  and  hydraulic  conductivity  functions  are  approximated  by  piecewise  linear 
functions  obtained  from  field  data.  The  resuhing  algdiraic  system  is  solved  by  a  variation 
of  the  nonlinear  SOR  algorithm.  The  advantage  of  this  approach  is  that  it  avoids  some  of 
the  numerical  oscillations  associated  with  large  derivatives  in  the  data.  Numerical 
calculations  are  presented  and  illustrate  the  following;  (i)  agreemem  of  the  numerical 
model  with  observed  data,  (ii)  dependence  and  comparison  results  as  a  function  of 
uncertain  data,  and  (iii)  suitability  of  these  algorithms  for  multiprocessing  computations 
via  domain  decomposition  methods  Extension  of  these  algorithms  to  heterogeneous 
porous  media  fluid  flow  are  discussed. 


'The  calculations  were  done  at  the  North  Carolina  Supercomputing  Center 
^Supported  b>  CNPq  and  Promon  Engenharia  from  Brazil. 
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1.  INTRODUCTION 

The  study  of  fluid  flow  in  porous  media  has  several  important  applications  in 
engineering  (Kaviany  1991  and  Nield  and  Bejan  1992)  Specific  examples  are:  filters  for 
industrial  use,  or  separators  in  aerospace  fuels  (Kaviany  1991);  use  of  geothermal  energ>’ 
(Rae  et  al.  1983  and  Kinuira  1989,  1989a);  oil  recovery  (Bear  1972);  groundwater 
(Mark  1992,  1993  and  Clothier  et  al.  1981)  aixi  agriculture  (Feng  1993) 

Industrial  chemical  or  radioactive  effluents  are  sometimes  deposited  at  the  surface 
or  in  drums  that  are  buried  underground.  In  both  normal  operations  and  in  accidental 
conditions,  it  is  required  to  give  an  analysis  of  the  transport  of  the  contaminants  through 
the  soil  (Muralidhar  1990,  1993).  The  first  step  in  this  analysis  is  the  mathematical 
simulation  of  fluid  flow  through  the  soil 

Richards  (1931)  developed  an  equation  that  is  a  combination  of  the  continuity 
equation  and  Darcy's  law  (Philip  1969),  and  it  models  fluid  flow  in  a  porous  medium.  It  is 
a  nonlinear  parabolic  partial  differential  equation  which  contains  the  empirical  functions 
tor  moisture  content  0(h)  and  hydraulic  conductivity  K(h). 

e{h),  -  V  K{h)Vh  -  K{h)^  =0  in  Qx(0,T)  (la) 

where  h  is  the  hydraulic  pressure  head,  z  is  the  vertical  direction  and  is  the  space 
domain  The  boundar>  condition  of  the  third  kind  has  the  form 

\K{h)'^  {h  ^  z)\- n  =  given  in  Sx(0,T)  (lb) 

where  n  is  the  unit  outward  normal  to  Q  and  S  is  the  boundary  of  Q  The  initial  condition 
is 

h  =  given  fort  =  0  in  Q.  (Ic) 
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In  general  the  equations  (la-lc)  are  coupled  with  a  parabolic  system  of  equations  for  the 
transport  of  a  number  of  comaminants  through  the  soil  (Freeze  and  Cherry  1979  and  Feng 
1993).  This  is  done  by  using  the  fluid  velocity  v  =  K(h)V(h  +  z)  which  is  computed  from 
the  above  system. 

In  practice  the  empirical  functions  for  moisture  content  and  hydraulic  conductivity 
have  several  troublesome  properties  First,  they  can  have  large  derivatives,  and  this  is 
often  the  case  for  hydraulic  conductivity.  Second,  they  are  not  precisely  known.  Third, 
they  can  have  strong  space  dependence  with  jump  discontinuities  resuhing  from 
heterogeneous  porous  media.  The  objective  of  this  paper  is  to  give  an  approach  to  these 
problems  which  is  based  on  methods  used  for  the  Stefan  problem  (Silva  Neto  aiul  White 
1993).  Particular  attention  will  be  given  to  the  first  problem  where  there  is  no  space 
dependence.  In  the  case  of  space  dependent  empirical  functions,  one  can  use  additional 
nodes  and  the  continuity  condition  on  the  fluid  velocity  (White  et  al.  1993)  to  generalize 
the  methods  of  this  paper. 

Traditional  methods  for  the  solution  of  (la-lc)  use  an  approximation  of  the 
empirical  data  by  expone*^al  functions  (van  Genuchten  and  Nielsen  1985).  Then 
numerical  methods  such  as  Newton,  Picard,  or  Lees  implicit  factored  method  can  be  used 
for  problems  without  large  derivatives  (Paniconi  et  al.  1991).  In  addition  to  addressing 
the  above  problems,  the  approach  of  this  paper  does  not  involve  expensive  function 
evaluations  and  does  eliminate  the  numerical  oscillations  associated  with  large  derivatives 
of  the  empirical  functions. 

In  section  two  we  present  the  general  approach  to  the  problem  which  is  adapted 
from  the  Stefan  problem  Here  the  empirical  functions  are  approximated  by  piecewise 
linear  functions  which  reflect  the  field  data  (White  and  Broadbridge  1988).  The  partial 
differential  equation  is  discretized  by  the  finite  difference  method,  and  the  resulting 
nonlinear  system  is  solved  by  a  nonlinear  SOR  algorithm  (Cryer  1971)  whic  scribed 
in  section  three. 
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Numerical  experiments  are  presented  in  sections  four,  five  and  six,  and  these 
experiments  were  chosen  to  demonstrate  the  feasibility  of  realistic  two  dimensional 
simulation  of  porous  flows.  Later,  we  indicate  how  one  can  extend  these  methods  to  three 
dimensional  and  heterogeneous  porous  flows.  We  show  agreement  of  the  numerical 
model  with  the  field  data  from  Brindabella  silty  loam  soil  Also,  we  show  how  one  can 
develop  comparison  results  which  deal  with  the  uncertain  empirical  data.  High 
performance  computing  issues  are  described.  Here  we  demonstrate  that  the  algorithm  in 
section  three  does  not  vectorize  well,  but  it  does  work  well  for  multiprocessors  when 
domain  decomposition  methods  are  used.  Finally,  we  state  our  conclusions  and  rdated 
work. 

2,  DISCRETE  VERSION  OF  RICHARDS'  EQUATION 

In  this  section  we  state  the  finite  dififeroice  discretization  of  Richards'  equation  and 
make  some  comparisons  with  the  Stefan  problem.  If  in  equation  (la)  the  last  term  is 
eliminated,  and  h  were  to  represent  temperature  with  K  now  denoting  the  thermal 
conductivity  and  0  the  enthalpy,  then  this  would  be  the  enthalpy  formulation  of  the  Stefan 
problem  (White  1985)  In  the  Stefun  problem  the  K  and  0  have  jump  discontinuities  at  the 
phase  change  temperature.  SOR  methods  c£  effectively  used  provided  the 
overrelaxation  is  not  applied  during  a  cell's  phase  change 

In  Richards'  e^iuation  we  will  approximate  K  and  0  by  piecewise  linear  functions 
which  could  be  viewed  as  a  number  of  "linear  phases"  associated  with  the  nonlinear  flow 
.As  in  the  Stefan  problem  we  will  apply  the  SOR  method  provided  the  cell  is  not  changing 
"linear  phase"  Table  1  gives  some  data  for  Brindabella  loam  which  was  extrapolated 
from  the  graphs  in  White  and  Broadbridge  (1988)  Note,  both  0(h)  and  K(h)  are 
monotone,  and  K(h)  has  large  derivatives. 
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Table  1: 


BrindabeHa  Data 


agm 

lllliSHSI 

■BSIH 

-8.0 

0.11 

0.0 

-1.2 

0.21 

0.000  118 

-0.8 

0.30 

0.000  327 

-0  7 

0  31 

0.000457 

-0.6 

0.32 

0.000  664 

-0.5 

0.33 

0.001  138 

-0.4 

0.34 

0.001  693 

-0  3 

036 

0.002  326 

-0.2 

0.38 

0  004  568 

-0.1 

0.42 

0.011  117 

-0.0 

0.485 

0.118  000 

In  SUva  Neto  and  White  (1993)  two  "linear  phases"  were  used  and  were  coupled 
with  the  KirchhojQT  change  of  dependent  variable.  Although  this  was  a  cnide 
approximation  of  the  data,  it  did  track  the  wet-diy  inteiftce  'M^iere  a  rapid  transition  from 
unsaturated  to  saturated  regions  occurs.  In  cases  where  either  the  transition  is  not  r^id 
or  there  is  space  dependence  of  the  data,  the  Kirchhoff  transformation  is  not  applicable. 
In  the  following  we  use  an  implicit  time  discretization  of  Richards'  equation. 

-  (K{h)h, ),  -  {Kih)h^  \  -  Kih),  =  0  (2) 

where  h  is  known  from  the  previous  time  step.  Here  we  are  in  two  space  dimensions,  and 
y  is  the  vertical  direction 

Next  we  discretize  the  space  variable  by  the  finite  difference  method.  In  the 
calculations  that  we  later  discuss,  we  consider  a  two  dimension  flow  with  zero  flow 
through  the  sides  and  bottom,  and  nonzero  flow  through  the  top.  The  finite  difference 
grid  is  illustrated  in  Figure  1  where  the  nine  types  of  boundary  cells  are  indicated  Here 
there  are  N  =  3  cells  in  each  direction  and  =  9  unknowns. 
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(K(h)l^  +  K(h))=R 


^K(h)hy+  K(h))=0 

Figure  1 :  Finite  Difference  Grid 


Let  (ij)  denote  the  location  in  the  finite  difference  grid.  Then  the  general  form  of 
the  finite  difference  equation  at  this  location  is 

- ^iAi  =  ^ (3) 
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In  the  above  equation  we  used  the  convention  that  uz  average  of  the 

hydrautic  conductivity  at  the  appropriate  surrounding  nodes.  We  also  will  assume  that  the 
surrounding  nodes  are  evaluated  at  a  "previous  iteration"  value.  Thus  equation  (3)  is  a 
piecewise  linear  system  as  illustrated  in  Figure  2.  Both  the  piecewise  linear  approximation 
of  the  moisture  content  and  hydraulic  conductivity  functions  are  monotone,  and  so.  T 
must  also  be  monotone  nondecreasing.  Since  the  term  on  the  left  side  of  equation  (3)  has 
negative  slope,  equation  (3)  has  a  solution,  and  it  is  unique.  In  Figure  2  the  data  is 
depicted  as  being  continuous,  but  this  need  not  be  the  case.  Even  if  Ffh)  has  a  jump  and 
remains  nondecreasing,  one  can  still  solve  for  a  unique  h  (Silva  Neto  and  White  1993). 


3.  NONLINEAR  SOR  ALGORITHM 

The  following  algorithm  has  evolved  from  the  work  by  Cryer  ( 1971)  for  set  valued 
systems  of  equations  that  may  come  from  models  of  the  Stefan  problem.  We  apply  a 
variation  to  the  system  given  in  (3).  The  following  variables  are  used; 

maxit  =  number  of  allowable  SOR  iterations  per  time  step, 
n^,n^=  number  of  cells  in  the  x  and  y  direction, 

(0,0)  =  overrelaxation  (larger  than  1  0)  and  underrelaxation  (less  than  1.0) 
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for  k  =  Kmaxit 

fori=  K/i, 

for  j  = 

compute  Ci,  and  d,  ^  as  given  in  (3) 

solve  for  h  in  (3)  as  given  in  Figure  2 

if  h  and  h,  ^  are  in  the  same  linear  phase,  then 

h.^  =(I-^)h.j  +a)h 
else 

K,  =  (1 "  ^  h 

endif 
end  loop  j 
end  loop  i 

test  for  convergence 
end  loop  k. 

In  the  computation  of  the  c  and  d  values  one  must  consider  the  nine  types  of  cells 
as  indicated  in  Figure  1  For  more  complicated  geometric  configurations  and  for 
heterogeneous  porous  media,  this  will  be  more  complicated.  If  adjacent  cells  have 
different  moisture  content  and  hydraulic  conductivity  data,  then  one  must  insert  additional 
nodes  between  the  cells  and  demand  continuity  of  the  flow  velocity  at  the  interface  (White 
etal  1993). 

In  the  solve  step  one  must  determine  which  linear  phase  the  solution  is  in,  and  this 
is  done  by  partitioning  the  vertical  axis  as  indicated  in  Figure  2  by  the  dotted  lines  that  are 
parallel  to  the  line  given  by  d  -  ch.  Hence,  the  solve  step  has  a  loop  in  it  which  was  not 
indicated  above.  This  hidden  loop  contains  the  nonlinear  nature  of  the  solve  step. 


Moreover,  if  there  is  a  large  number  of  linear  phases,  then  the  solve  step  will  become  more 
expensive  to  compute. 

The  overrelaxation  is  used  to  reduce  the  number  of  outer  iterations,  and  the 
optimal  choice  will  vary  with  the  number  of  unknowns  The  imderrelaxation  is  used  to 
avoid  numerical  oscillations,  and  this  works  well  for  choices  between  0.8  and  0.9  The 
numerical  oscillations  are  a  result  of  passing  from  one  linear  phase  to  the  next  linear  phase 
This  deals  with  the  large  derivatives  of  the  data  by  breaking  the  changes  in  slopes  into  a 
number  of  smaller  changes  in  slope.  We  found  this  to  be  much  more  effective  than  the 
traditional  method  of  reducing  the  time  step. 

We  experimented  with  a  number  of  convergence  tests.  Finally,  we  imposed  two 
conditions; 

(i)  max|  new  h  -  old  h  |  <  £,  and 

(ii)  JJ  1  new  0  -  old  0\^  £j- 

The  first  condition  is  aimed  at  possible  convergence  of  the  pressure  at  each  node.  The 
second  condition  reflects  possible  convergence  of  the  total  moisture,  and  it  is  more  of  a 
global  test  than  the  first  condition. 

4.  COMPUTATIONS  FOR  BRINDABELLA  LOAM 

The  purpose  of  these  computations  is  to  see  if  our  model  of  Richards'  equation  will 
accurately  track  the  movement  of  moisture  through  Brindabella  loam.  We  compare  our 
calculations  with  the  observations  in  White  and  Broadbridge  (1988)  In  our  numerical 
model  we  considered  a  0.3[m]  x  0.3[m]  region  with  boundary  conditions  as  indicated  in 
Figure  1  In  the  top  boundary  we  used  R  =  0.0I65[m/hr].  and  the  initial  pressure  was  set 
as  h  =  -8.0[m].  The  moisture  content  function  was  a  linear  interpolation  of  the  data  in 
Table  1  The  hydraulic  conductivity  data  indicates  a  very  increasing  and  concave  up 
function;  consequently,  linear  interpolation  of  the  data  would  generate  large  errors  In  the 
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calculations  presented  in  Figure  3  a  we  used  a  linear  interpolation  of  the  modified  data  for 
hydraulic  conductivity  in  Table  1 ;  we  reduced  the  interior  values  by  50  percent  and  kept 
the  two  end  values  at  0.0  and  0. 1 18. 

In  our  computations  we  set  the  following  parameters  at 
At  =  0.125[hr]  Ax  =  0.015[m] 

N  =  20[cells  in  each  dir.]  R  =  0.0165[m/hr] 

oa  =0.9  TO  =  1.4 

ei  =  10-4  62  =  10-8 

Convergence  was  usually  attained  in  20-40  iterations.  If  no  underrelaxation  was  used, 
then  numerical  oscillations  would  occur  about  once  every  20  time  steps. 

During  the  initial  times,  the  hydraulic  conductivity  is  small,  and  Richards'  equation 
is  dominated  by  wave  like  properties.  As  time  progresses  the  hydraulic  conductivity 
increases  so  that  Richards'  equation  is  dominated  by  difiiision.  At  time  6.0[hr]  the  steady 
state  solution  has  essentially  been  reached  At  the  bottom  (y  =  300[mm])  the  porous 
media  is  at  saturation  (0  =0.485)  At  the  top  (y=0[mm])  the  porous  media  has  pressure 
such  that  K(h)  =  R  (0(h)  =0.426).  At  this  time  the  difiiision  force  is  equal  to  the 
gravitational  force;  hence,  no  more  moisture  can  enter  the  porous  media. 
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Figure  3a:  Computed  Moisture  for  Variable  Times 

The  observed  moistures  are  indicated  by  discrete  points  in  Figure  3b.  These  were 
for  the  times  of  1.0  (diamonds),  2.2S  (triangles)  and  5.0  hours  (squares).  The  computed 
values  give  good  agreemem  with  the  observed  values.  The  computed  moisture  comem 
curves  are  somewhat  more  smoothed  than  the  observed  moisture  comem  data.  This  may 
be  attributed  to  large  hydraulic  conductivity  data;  if  one  reduces  the  hydraulic 
conductivity  for  smaller  pressures,  then  a  sharp  from  can  be  calculated  to  match  the 
observed  moisture  comem  data. 
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Figure  3b:  Observed  Moisture  Data  for  Variable  Times 


5.  COMPUTATIONS  WITH  UNCERTAIN  DATA 

This  section  contains  an  analysis  of  the  moisture  as  a  fimction  of  the  empirical 
data  of  the  moisture  content  and  hydraulic  conductivity  functions.  In  practice  much  of  this 
data  is  not  precisely  known,  and  therefore,  the  effects  upon  the  computations  from  any 
model  will  have  some  uncertain  aspects.  In  our  numerical  experiments  we  decreased  K 
and  the  computed  moisture  at  the  lop  increased.  We  also  increased  0  and  the  computed 
moisture  at  the  top  increased  In  the  computations  indicated  in  Figure  4,  for  time  equal  to 
1  25[hr],  we  decreased  K  and  increased  0,  and  the  largest  computed  moisture  content  at 
the  top  was  the  result  Here  we  kept  the  data  at  the  end  points  of  Table  1  fixed  and  varied 
the  interior  data  by  increments  of  20  percent  In  all  computations  the  computed  moisture 
content  at  the  top  increased  while  the  computed  moisture  content  at  the  bottom 
decreased  This  happens  because  the  sides  and  bottom  do  not  permit  flow  through  them, 
and  the  total  moisture  must  remain  constant. 
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Figure  4:  Moisture  and  Variable  Data 
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In  order  to  gain  some  insight  on  this,  it  is  instructive  to  examine  the  finite 
difference  equation  at  the  top  region.  In  the  case  of  the  top  center  nodes,  d  has  the  fi)rm 

d  =  —  -  — -----  +  d  where  d  has  form  amilar  to  that  given  in  (3). 

Ay  Ay 

Figure  5  shows  that  if  F  increases,  then  the  solution  of  (3)  wiU  decrease.  Also,  if  d 
decreases,  then  the  solution  of  (3)  will  decrease.  Therefore,  if  both  F  increases  and  d 
decreases,  then  the  solution  of  (3)  will  decrease.  If  K  decreases,  then  d  will  increase  and 
F  will  decrease.  However,  if  both  K  decreases,  and  0  increases  enough,  then  F  will 
increase.  For  our  choices  of  At  and  Ay  this  is  the  case.  Of  course,  this  is  just  an  analysis 
at  one  grid  point,  and  the  argument  requires  much  more  careful  discussion. 
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6.  COMPUTATIONS  USING  MULTIPROCESSORS 

In  the  computations  rqx>fted  in  this  section  we  tried  to  implement  the  above 
algorithm  on  a  sii^e  CPU  with  vectorization  on  a  Cray  Y-MP,  the  Alliant  FX-40  with 
two  vectorized  CPUs,  and  the  Kendall  Square  Research  KSRl  with  up  to  16  CPUs  and  no 
vectorization.  In  the  calculations  in  Silva  Neto  and  White  (1993)  the  vectorization 
methods  did  not  seem  to  work  well.  These  attempts  involved  reordering  the  nodes  by  the 
red-black  order  (checker  board  order).  This  method  also  did  not  work  well  for  our 
current  problem  The  reason  for  this  is  that  the  inner  most  loop  has  computations  which 
are  too  complicated  to  effectively  be  done  on  a  vector  pipeline. 

The  multiprocessing  approach  with  domain  decomposition  reordering  (White 
1987,  or  Ortega  1988)  was  much  more  promising  This  reordering  is  depicted  in  Figure  5 
where  L  =  4  (the  number  of  larger  blocks  of  nodes)  and  the  classical  order  of  the  blocks  of 
grid  points  is 

P  P  P  P  P  P  P 

The  domain  decomposition  order  lists  all  the  smaller  interior  boundary  blocks  first  (even 
number  blocks  in  Figure  6)  and  is 
P„P,,P„P,,P„P„P, 
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The  idea  behind  this  reordering  is  to  take  advantage  of  the  5>point  finite  difference  pattern. 
Once  the  calculations  in  the  even  blocks  have  been  done,  then  the  calculations  in  the  large 
odd  blocks  are  independent  of  one  another. 


Nonlinear  SOR  Algorithm:  Domain  Decomposition 
fork=  l,maxit 

concurrently  do  SOR  over  the  even  blocks 
update 

concurrently  do  SOR  over  the  odd  blocks 
test  for  convergence 
end  loop  k 

In  our  calculations  we  used  the  Kendall  Square  Research  multiprocessing 
computer,  KSRl,  which  is  operated  by  the  North  Carolina  Supercomputing  Center  The 
KSRl  multiprocessing  computer  has  three  parallel  constructs  that  can  be  used  in 
FORTRAN  code  ,  tile,  parallel  section  and  parallel  region.  Tile  is  used  to  partition  loops 
and  is  very  effective  for  simple  computations  such  as  matrix  multiplications  Parallel 
section  can  be  used  to  concurrently  execute  different  code  segments.  We  used  parallel 
region  which  duplicates  a  code  segment  and  uses  different  data  streams.  In  our 
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computations  we  controlled  the  number  of  processors  by  using  a  team  of  processors  that 
are  assigned  at  the  loginning  of  the  code  and  are  used  to  reduce  parallel  overhead 

Table  2  shows  the  speedup  and  efficiency  for  a  variety  of  L  (the  number  of  large 
blocks)  and  N  (the  number  of  cells  in  each  direction)  These  quantities  are  defined  as 
follows 

Sl  =  (CPU  time  using  one  block )/(CPU  time  using  L  large  blocks)  and 
El  =  Sl/l 

In  the  first  four  rows  N  varies  and  L  is  fixed  We  see  increased  speedup  and  efficiency  as 
N  increases.  This  is  a  result  of  decreased  parallel  overhead.  In  the  last  four  rows  N  is 
fixed,  and  L  is  increased.  Here  the  speedup  increases,  but  the  efficiency  decreases.  Of 
course,  if  there  are  many  larger  blocks  (L)  and  the  number  of  cdls  in  each  direction  (N) 
remains  the  same,  then  the  relative  size  of  the  larger  block  to  the  smaller  block  decreases. 
This  partially  accounts  for  the  decreased  efficiency.  These  calculations  did  not  attempt  to 
make  the  most  efficiem  use  of  the  FORTRAN  language,  or  the  most  efficient  use  of  the 
KSRl  computer’s  architecture 

Table  2:  Speedup  and  Efficiency 


N 

L 

Si. 

El. 

20 

2 

1.56 

0.78 

40 

2 

1  68 

0  84 

80 

2 

1.75 

0.88 

160 

2 

1.78 

0  89 

160 

4 

3  16 

0  79 

160 

8 

5.09 

0  64 

160 

16 

7.29 

046 
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7.  CONCLUSIONS 

Richards'  equation  was  approximated  by  the  finite  difference  method,  and  the 
empirical  data  for  the  moisture  content  and  the  hydraulic  conductivity  were  approximated 
by  piecewise  linear  functions.  The  resulting  nonlinear  algebraic  system  was  solved  by  a 
variation  of  the  nonlinear  SOR  iterative  method.  Good  convergence  properties  were 
observed  for  three  types  of  calculations  which  were  chosen  to  demonstrate  the  feasibility 
of  realistic  numerical  simulations  using  this  method.  These  included  an  accurate 
simulation  of  fluid  flow  in  Brindabella  loam,  a  sensitivity  analysis  of  the  computed  solution 
upon  the  empirical  data,  and  the  use  of  mult4>rocessing  computers  via  domain 
decomposition  methods. 

In  the  above  calculations  the  empirical  data  did  not  have  a  space  dependence. 
However,  in  White  et  al.  (1993)  we  illustrated  for  a  steady  state  and  one  space  dimension 
version  of  Richards'  equation  that  the  compact  volume  method,  in  place  of  the  finite 
difference  method,  could  be  effectively  used  for  such  heterogeneous  problems.  The 
compact  volume  method  can  be  viewed  as  an  enhanced  finite  difference  method  where 
additional  nodes  are  inserted  at  the  cell  interface  and  additional  equations  are  generated  by 
requiring  continuity  of  the  fluid  velocity  at  these  interfaces.  This  may  be  done  for  all  cell 
interfaces  or  for  just  those  cells  where  the  empirical  functions  change  with  respect  to  the 
space  variable.  We  expect  the  methods  of  this  paper  to  generalize  via  the  compact  volume 
method  to  the  more  complicated  heterogeneous  case 
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Abstract 

It  is  well  known  that  Kolmogorov  equation  is  a  fundamental  equation  in  Applied 
Science,  especially  in  Electrical  Engineering.  Our  original  motivation  is  to  solve  the 
Duncan- Mortensen-Zakai  equation  in  nonlinear  filtering  theory.  If  the  observation  ^(x) 
is  a  constant  in  Dimcan-Mortensen-Zakai  (DMZ)  equation,  then  it  becomes  the  famous 
Kolmogorov  equation.  If  we  treat  /i(x)  as  a  function  agmn,  then  the  resulting  equation  is 
called  generalized  Kolmogorov  equation.  In  this  paper,  we  write  down  the  formal  solution 
of  this  generalized  Kolmogorov  equation  in  a  closed  form.  We  shall  report  the  convergent 
solution  in  the  subsequent  paper. 

§  1.  Introduction 

In  the  sixties  and  early  seventies,  the  basic  approach  to  non-linear  filtering  theory 
was  via  'Hnnovations  methods’*,  originally  pr^fKised  by  Kailath  in  1967  and  subsequently 
rigorously  developed  by  FVijisald,  Kallianpur  and  Kunita  ([F— iif — iif)  1972)  in  their  seminal 
paper.  As  pointed  out  by  Nfitter,  the  di£5culty  with  this  approach  is  that  the  innovations 
process  is  not,  in  general,  explicitly  computable  (except  in  the  well-known  Kalman-Bucy 
case).  The  idea  of  using  estimation  algebras  to  construct  finite  dimensional  nonlinear 
filters  was  first  proposed  in  the  early  eighties  by  Brockett  and  Clark  [Br  —  Cl],  Brockett 
[Br]  and  Mitter  [Mi].  The  motivation  came  from  the  Wei-Norman  approach  [We- iVo]  of 
using  Lie  algebraic  ideas  to  solve  time  variant  linetu  differential  equations.  The  extension 
of  Wei-Norman ’s  approach  to  the  non-linear  filtering  problem  is  much  more  complicated. 
Instead  of  zm  ordinary  differential  eqtiation,  we  have  to  solve  the  Dimcan-Mortensen-Zakai 
(DMZ)  equation,  which  is  a  stochastic  partial  differential  equation.  By  working  on  the 
robust  form  of  the  DMZ  equation  we  can  reduce  the  complexity  of  the  problem  to  that 
of  solving  a  time  variant  partial  differential  equation.  Wong  in  [Wol]  constructed  some 
new  finite  dimensional  estimation  algebras  and  the  Wei-Norman  approach  to  synthesize 
finite  dimensional  filters.  However,  the  systems  considered  in  [Wol]  are  very  specific  and 
the  question  whether  the  Wei-Norman  approach  works  for  a  general  system  with  finite 
dimensional  estimation  algebra  remains  open. 

*  Research  supported  by  Army  Grant  DAAH-04-93G-0006 
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ReoenUy,  Tkm,  Woog  and  the  aeoond  author  [T-IT'-K]  have  examined  the  i»operties 
of  finite  dimenaonal  estimation  algdaras  and  the  Wei-Norman  approach  in  detail.  Tliere  a 
class  of  filtering  ^jrstems  having  the  property  that  the  drift  term«  /,  of  the  state  evolutioa 
equation  is  a  gradient  vector  fidd  was  considered.  In  [1^02],  the  concept  of  ft  is  introduced, 
whidi  is  defined  as  the  n  x  n  matrix  whose  (ij)-entry  is  In  view  of  Poincare 

lemma,  /  is  a  gradient  vector  field  if  and  only  if  ft  =:  0.  More  recently,  the  second  named 
author  [Ka]  considered  a  more  general  class  of  filtering  systems  having  the  property  that 
are  constant  for  all  i,j  i.e.  ft  is  a  skew  constant  matrix.  These  include  Kalman- 
Bucy  filtering  systems  and  Bene’s  filtering  systems  as  special  cases  and  finite  dimensional 
filters  were  constructed  explicitly.  FVom  Lie  algebraic  point  of  view,  Chen,  Chiou,  Leung 
and  the  second  named  author  [Ch  —  Va]  [C  —  L  —  Y]  have  shown  that  these  are  most 
general  finite  dimensional  filters  at  least  for  dimension  of  state  space  less  than  four.  In 
many  senses,  the  Lie  algebraic  viewpoint  has  been  remarkably  successful  and  the  recent 
work  has  given  us  a  deeper  tmderstanding  of  the  DMZ  equation  which  was  essentail  for 
progress  in  non-linear  filtering,  as  well  as  in  stochastic  control. 

However,  we  should  notice  that  the  Wei-Norman  approach  only  reduces  the  DMZ 
equation  to  a  finite  system  of  ordinary  differential  equations  and  the  following  gieneralized 
Kdmogorov  equatkm 


(lo)  = {5  i:(|-  -  -  Ktf  (*) + 

^  i*I  '  '  '»=!  '  •*!  ial  '  * 

u(0,  x)  s  (To 

/ 

It  is  the  purpose  of  this  paper  to  give  a  closed  form  formal  solution  of  the  above  equation. 


§  2.  Basic  concepts 

The  filtering  problem  considered  here  is  based  on  the  following  signal  observation 
model : 

(2  0^  /  <^^(0  =  Mt))dt  +  g(xit))dv{t)  i(0)  =  xo, 

\  dy{t)  =  h{x{t))dt  dw{t)  y(0)  =  0, 

in  which  x,u,  y,  and  w,  are  respectively,  i2",  i?"*  and  vedued  processes,  and  t;  and 

w  have  components  which  are  independent,  stamdard  Brownian  processes.  We  further 
assume  that  n  =  p\  f,h  are  smooth,  amd  that  y  is  am  orthogonatl  matrix.  We  will  refer 
to  i(<)  as  the  state  of  the  system  at  time  t  amd  to  y(#)  ats  the  observation  at  time  t. 

Let  p(t,x)  denotes  the  conditionad  probability  density  of  the  state  given  the  observa¬ 
tion  y(s)  :  0  <  s  <  t.  It  is  well  known  (see  [Da  —  Ma]  for  example)  that  p{t,x)  is  given 
by  normadizing  a  fimction,  (7(t,x),  which  satisfies  the  following  Duncam-Mortensen-Zakad 
equation; 


(2.1)  d<T{t,x)  =  Lo<Tit,x)dt  -i-  y^Li<T(t,x)dy^(t),  (t(0,x)  =  <to, 

i=I 
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where  U  =  \  ^  “  E"*!  lit  “  2  *  =  1, . . .  ,m,  li  is 

the  zero  degree  differential  operator  of  multiplication  by  hi.  oq  is  the  probability  density 
of  the  initial  point,  Xq. 

Equation  (2.1)  is  a  stochastic  partial  differential  equation.  In  real  applications,  we  are 
interested  in  constructing  robust  state  estimators  from  observed  sample  paths  with  some 
property  of  robustness.  Davis  in  [Da\  studied  this  problem  and  proposed  some  robust 
algorithms.  In  our  case,  his  basic  idea  reduces  to  defining  a  new  unnormalized  density 


m 

u(<,  i)  =  exp{-  x) 

1=1 


It  is  easy  to  show  that  u(t,  x)  satisfies  the  following  time  varying  partial  differential  equation 

(2.2) 

-^(t,  x)  =  Lou(t,  x)  +  5^  y,(t)lLo,  Jf)  +  2  S  y‘(Oy>(0[[-^o,  ii],  Lj]u{t,x), 


dt 

u(0,x)  =  <To 


i=l 


«.>=! 


where  [•,  •]  is  the  Lie  bracket  defined  as: 

Definition  :  If  X  and  Y  are  differential  operators,  the  Lie  bracket  of  X  and  Y  ,  [X,  Y], 
is  defined  by 

[X,Y]^^XiYO-Y{XO 


for  any  C°°  function 

In  §3,  we  shall  write  down  the  formal  solution  of  (2.2)  explicitly  in  closed  form. 


§  3.  Formal  solution  to  generalized  Kolmogorov  equation 

The  purpose  of  this  section  is  to  write  down  a  formal  solution  of  the  time  varying 
differentizil  equation  (2.2). 

Lemma  1  Equation  (2.2)  is  equiveJent  to  the  following  equation. 


(3.0) 


dt 


1=1  i=l 


i=i 

n 


«=1  ^  •  i=l  i=l 


u(0,l)  =<To(x) 
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Proof 


.Ha  '*  a  /  ^ 

[Io,A.(x))  =  i  -/,(x))’  -  (£  ^(x)  +  ^//(x)  +  £fc;(x)),/.i(x) 

>=i  ^  i=i  ^  >=1  J=1 

j=l  J 

1  ^  ^  ^  j:  /  \  /  \ 

=  2  S  +  E  -  E 


J=1  J 


^dXj^  'dx, 

J=1  •'  J=1  •' 


E  ax/^>axj  +  2  E  ax;  E/>(^)ax/^) 

h’'“)-'‘'l=E[ffH[itH 

£o  +  f^!,i(t)[Io,Iil  +  5EE9i(<)w(<)Po.Ii|,Iil 

1=1  ^  1=1  i=i 

1  ^  ^  ^  1  m  n  A*  A 

=  5  E  ^  -  E  -  E  - 1 E  ^?(^) + E  E 

^  i=i  ,=i  i=i  ^  i=i  ,=i  >*i 

1  in  n  ii2  Ik  ^  ^  AL 

+ 1 E  Ev-w^w  -  EEi'.w/^wf^w 

i=l  js=l  i  1=1  ji=l  ^ 

.  FH  ni  n  QL 

nEEE^.(‘)..(o[f^(x)][f^(x)] 

i=i  j=i  *=i  '  ' 

i=l  '  1=1  7  =  1  1=1  1=1 

1  FW  FFl  FI  ■ 

+  2  5Z  -  mi  yi(0/7(-p)  j-^(^) 

i=l  i=l  7  =  1 

.  FTl  FTI  FI  QL  Q  L 

+  |EEv.Wv,«)E[ftw][|^<^)l 


i=l  7  =  1 

1  Ar  5 


=  5E[^-N-Ev.«)|tw]]  ^Eltw-E^O^)] 

i=l  7=1  i=l  7=1  * 

-  2  E  “  E ''■’•‘'a^*"'']  “  E  aft*"''  ~  2  E  '*’  2  E  siCO^AiCi) 

-  EEv-««x)g(x) .  i  EEv.«)M.)E[g(x)]  [f^,x)] 


=  Itli-.-  - 1 1  - 1  E/^w 

lasl  jstl  isl  tsl 

11  m  ni  -  n  r  m  12  i  ”• 

+  E  E  - 1 1:  [E  v,(o|t(x)]  -  i  E  *?(x) 

-  E  E  v.«)/,(x)g(x) + i  E  E  .^(‘)v,(«)  E[it<^>] 

=  Itilr  [/.(^)  -  E^.wit(x)]]'  -  i  [E  f  (X) + E/^(x) + E.f(x)] 


•=i 
Q.E.D. 

We  observe  that  if  hi(x),  1  <  *  <  m  are  constants  in  the  robust  DMZ  equation  (3.0), 
then  (3.0)  becomes  the  Kolmogorov  equation 


du 


('->  =  {  5  E  -  5  g  t  J  + 1  *0  }“<'■ 


dt 

u(0,x)  =  <To 

In  generalized  Kolmogorov  equation  (1.0)  is  the  above  equation  which  we  let  hi  depend  on 
j;  again. 

Theorem  2  The  equation  (1.0)  has  a  formal  as3rmptotic  solution  on  /?**.  In  fact,  the 
solution  is  of  the  following  form 

(3.1)  u(t,x)=  f  •••  /  —^=—r^^'^exp(-'^{xj-(jf/2t)b{t,x,OMOd(i‘-d(n 

J-oo  J-oo  (v27r)'*  " 

where  6(t,i,0  = 

Here  at(x,^)  are  described  explicitly  as  follows.  Let 


(3.2) 


a(^.0=/ 

■'0  i=l 


Then 

(3.3)  ao(x,0=e“(*-^) 

Suppose  that  afc_i(x,^)  is  given.  Let 

‘\ef, 

i=I 


(3.4) 


i=l 


i=l  i=I  * 
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Then,  for  fc  >  1, 


a*(x,  0  =  f  +  t{x  -  0, 

Jo 


Proof  We  shall  prove  that  (3.1)  is  a  formal  solution  of  (1.0).  Putting  (3.1)  into  (1.0) 
we  have 


L.H.S.  of  (1,0)  =£ 


On  the  other  hand, 


r 

=  £  ■  •  •  II  E  [-f^.  -  WK.. 0 

~  oj  MOd^i  ’  din 

^L'  L  1  « -  ("•  -  «■'#<'■ 


(\/2^) 
5^6 


0 


■"/-»■■■  £  ■  ““K- 1  h  - 

dh 

—  tf i[x)  {^1  ^ii)  +  "T,  oj  <^0(0^0  ■  ■  ■  din 
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-  (Xi  -  0  +  (*.•  -  ^•)/i(a?)6(<,  X, 0  -  0 

-  (a^i  -  3;.  0  +  0  -  0 

-  tfi(x)-^(t,x,()  +  (Xj  -  ^i)fi(x)b(t,x,()  -  tfi(x)^(t,x,0 


+  tff{x)b{t,  X,  0]  MO<^i  -  d(n 
-  7-00  ■■■  7-00  2r-)[-T— 


-  2(xj  -  a:,  0  +  2(xj  -  (i)fi{x)b{t,  x,  0 

-  6(<,x,0  +  t^(t,x,()  -  t^{x)b(t,x,i)  -  2tfi{x)-^{t,x,() 

+  t/?(x)6(t,  X,  ^)]  <7o(0«^6  •  •  • 


i2.fr.5.  0/  (1.0) 


•£'£  Is  S<- “'■■<'■■■« 


+ 1 E  - 1 E  *■£)-'  E  *' « 

1=1  *  i=l  i=l 


,n  ^  ^ 

+  2  S  ”  2  ^  ■'■  ^  ^0 

»=1  i=l  *  1=1  i=l 


•  <^o{Od^i  •  •  •  d^n 

It  follows  that  (3.1)  is  a  solution  of  (1.0)  if 


x,0=-  0  +  -  ^i)fi{x)Ki^ 0 


i=l  *  1=1  * 

U^dfi 


1=1 

n  m 


5xi 


i=l  1=1  ^  ‘  <=1  «=1 
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i.e.  if 


-  ei)^(<.x,f) + ^(x.  - 

»«i  *  t=i 

i=i  •  i=i 

+  I  [-2  ^  ~  S 

t=i  '  t=i 

Put  b{t,x,^)  =  5!^*lo  (3-5).  We  have 

of  (3.5) 

=  -  E  E(*i  -  fi)^(*,  f  )<‘  +  E  E(»i  -  (i)fi(x)M^,  f )«‘ 


1=1  fc=0 


t=:l  ik=0 


+  EE|^(^.«‘‘^‘ -EE/<w^(-.«‘‘^' 

1=1  AsO  •  i=l  k=0  ^  * 


*=o  i=l  i=i 

oo  »  Q  oo  n 


fc=0  i=l 


fc=0  »=1 


+  EE|%^(^.«<‘-EE/-w^(^. 

fc=i  «=i  ‘  *=1  t=i 

-  E  (E  + 1 E  0*" 

*=i  1=1  *  «=i 

>»  n  n 

=  ~  E(^‘  “  ■•■  E^®*  “  ^«)/i(a^)ao(x,  0 

*  — 1  *  2 t 


dok-i 


OO  n 


+  E  “  E(^'  “  +  E^a:,  -  ^i)Mx)ak{x,^) 

t_i  .1—1  .  , 


*=1  i=l 
1  ^  d^cik-i 


«=i  *  i=i  * 

“(E^+ 1  E'‘i(*  ))•“-■(*’«)]' 


(a^.O 


I.ff.5.  o/ (3.5)  =J^kak{x,Ot' 
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Therefore  (3.1)  is  a  solution  of  (1.0)  if  the  following  (3.6)  and  (3.7)  are  satisfied 


(3.6) 


^(Xi  -  =]C(xi  -  (i)Mx)ao{x,() 


(3.7)  For  jfc>l 
(*  - 

i=l  i=l  * 


5  E  %'(-.<)  -  E/-w^(*.f)  -  (E  f  (-) + i  E 

i=l  *  i=l  i=l  *  f=sl 


^dfi 


Differentiate  (3.2)  with  respect  to  n 


=  -())<!*+ 1  E(^i-f>) 


dfj  $  +  f(a:  -  O] 


=  //.({+«(.-  o)^< + /'  E(x>  -  f,)  g  ft  ({+,(*  -  {)) 

9Uk+t(Xk  -  (k)) 

■y 

"  lo  /'  "  ^0““ 

=  j('  /i(f  +<(x  -  o)*  +  j(‘  *  E(Xj  -  +  ((x  -  ())dt 


(3.8)  ^(xi-«,)f:(x,«) 


n  -1  -1  n 

=  ~  /  /•  (^ + -o)di+  i Y^i^j  -  ijWj -  o) 

j=i  j=\ 


=  E(*t  - 
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Ut  ao(x,0  =  as  in  (3.3).  Then 


da 


i=l 

n 


=  -  ^i)/>(3f)ao(a:,  0  in  view  of  (3.8) 

i=i 


So  equation  (3.6)  is  satisfied.  Let 


«("’«  =  I  -  E/<w^(--«  -  5(E'‘?(*))“‘->(-.f) 


as  in  (3.4)  and  for  ib  >  1 


ak(x,0  -  6“^*’^^  jf  +t(x  - 

Then 

dak,  .  ' 

=  j('  (‘-V-«+-(-0.f)„((  +  ((:,  _ 

wAere  f(? +((i  -  {),{)  =  (?  +  <(*- O.f) 

+ /'  + *<">  - 
=  /'  (‘-'e-«+'<-«’«S»(<  +  <(x  -  «,«)* 
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=  jf  -  «)■<)* 

+  «•<'■<)  j('  J*  ^(li  -  (0^  (f  +  ‘(i  -  «,{)<« 

=  E(^i  -  «i)/i(i)<-*(i,{)  +  j['  + 1(»  -  £),£) 

n 

=  E(*<  -  0  +  ,  {) 

t=l 

-  +  t{x  -  0,  dt 

n 


t=i 


0  =  y^i^i  -  (i)fi(x)ok(x,  0  +  ffk(x,  0  -  kak(x,  0 

(*  -  =  sifc(*>0 


«S=1 


tsl 


So  equation  (3.7)  is  also  satisfied. 


Q.E.D. 

Lemma  3  Let  ao(x,0  =  1  and  dk-i(x,()  =  e~“^*’^^afc_i(x,^).  Let  gk(x,() 
e~'"^^’^^gk(x,0-  Then 


dk{x,0=  f  t'‘~'gk{(  +  Kx  -  OyOdt 

Jo 


t=l  *  1=1 

rly-»5^a,  ,  X  da  .  v^r/  \ 

i=l  *  i=l  t=l 

“  2  (£  ^^0  ~  S 


5afc_i 


where 


Proof: 

duk- 


df^ok-i 


'5^0  ,  da 


yfc(a;,0  =  e~“^*’^^fffc(x,0 

---«(*.0  ri  V^£*zi 


i=l  ‘  «=1 

1  ff®  W  * 

-  5(E'^?(^))‘'‘-(^'<)  -  E  |f^(^)“*-(^-«] 

i=l  i=l  * 


-  2  (S  (a;))aifc-i(x,  0  -  XI 

•  —^1  ^'^.1  ® 


A/da 


aajfc_, 


»=l  •  i=l  1=1  ‘  »=1 

-E^H«*->(^-« 


On  the  other  hand 


a*(x,0  =  e“(*’«  /  t‘-'e-“<«+*(*-«>’«>Sr4e  +  <(x-O,0<^< 
akix,0=  f  + 

Jo 


Q.E.D. 
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Theorem  4  The  equation  (1.0)  has  a  formal  solution  on  /Z”.  In  fact  the  solution  is  of 
the  following  form 

(3.9)  u(<,*)* 

II"  L  + *<*  -  «'»■") 

•  [1  +  ai(a:,  y)t  +  02(1,  y)<*  +  •  •  •  +  a*(ar,  y)<*  +  •  •  ■\(To{y)dyx  •  •  •  <iy„ 
where  a*(x,y)  =  t*~^sf*(y  +  t(x  -  y),y)di  and 
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NONLINEAR  PARTIAL  DIFFERENTIAL  EQUATIONS  OF  INTEREST 

IN  NONLINEAR  OPTICS* 


M.  J.  POTASEK 

Department  of  Applied  Pt^rsics 
Columbia  University 
New  York,  NY  10027 


ABSTRACT.  The  analysis  of  nonlinear  optical  phenomena  is 
important  because  of  the  wide  range  of  potential  applications  in 
various  fields  of  science  and  engineering.  Photonics  is  becoming 
increasingly  significant  and  research  in  this  area  is  of  vital 
importance  for  applications  ranging  from  optical  computing  to 
novel  light  sources.  It  has  been  recognized  that  solitons  may 
be  natural  bits  of  information  for  gating  and  switching.  While 
considerable  effort  has  focussed  on  the  picosecond  time  domain, 
research  is  now  evolving  in  the  femtosecond  time  domain.  We 
have  calculated  the  performance  of  the  first  femtosecond  soliton 
all-optical  switch. 


INTRODUCTION.  Because  of  their  rapid  response  time,  non¬ 
linear  optical  materials  are  gathering  considerable  interest. 

Much  of  this  research  has  involved  picosecond  or  greater  dxiration 
pulses.  However,  the  recent  development  of  femtosecond  light 
sources  in  the  visible  and  near  infrared  spectral  regions  makes 
possible  the  exploration  of  new  phenomena  on  ultrashort  time 
scales.  Research  into  the  fundamental  aspects  of  ultrashort  pulses 
is  of  interest  for  possible  femtosecond  soliton  lasers,  amplifiers, 
and  optical  computers.  Femtosecond  pulses  are  now  available  from 
a  number  of  optical  lasers,  such  as  the  NaCl  color  center  laser, 
mode-locked  Er  fiber  laser  and  the  colliding  pulse  mode- locked  semi¬ 
conductor  laser.  However,  mathematical  analysis  is  now  just  evolving 
for  this  rapidly  developing  field  and  has  not  kept  pace  with  experi¬ 
ments.  The  ^nphasis  on  shorter  pulses  requires  extent ion  beyond 
traditional  techniques.  Light  propagating  in  a  dielectric  medium 
is  given  by 


V^E  -  ®  (1) 

eoc  3r 


♦Supported  in  part  by  the  U.S.  Army  Research  Office. 
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v^«r«  E  is  the  electromagnetic  field  and  D  is  the  displacenwit  vector 
vAiich  is  expanded  in  a  power  series  in  E, 


(2) 


t  I  t 


+  8o  J  J  I  X3  E  (ti)  E  (4)  E  (*3) 


For  femtosecond  pulses,  one  obtains 


iqj+2qtt+  lql*q  +  Uqa+iejIqPqt +ie3q^q^ -eJqPq,  -  0 


(3) 


where  q  is  the  dimensionless  slowly  varying  envelope  of  the  electro- 
maagnetic  field,  the  subscripts  z  and  t  refer  to  differentiation  with 
respect  to  space  and  time,  respectively,  eund  the  coefficients  depends 
upon  various  dispersive  and  nonlinear  constants.  For  certain  physical 
conditions.  Eg.  (3)  reduces  to 


ia_+  la 


(4) 


The  figure  below  shows  the  propagation  of  the  soliton. 


FEMTOSECOND  ALL-OPTICAL  SWITCHES.  An  in^ortant  area  for  optical 
comoutincr  and  communications  is  intensity  dependent  optical  switching. 
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optical  switching  is  d^pmdant  on  tha  nonlinear  index  of  refraction  of 
optical  materials.  Device  configuration  consists  of  nonlinear  directional 
couplers  or  birefringent  couplers.  Experimentally  all-optical  logic 
devices  Including  an  inverter.  exclusive-OR  and  AND  logic  gates  were 
demonstrated  in  the  picosecond  time  doaain.  This  presentation  describes 
the  mathematical  results  for  novel  femtosecond  optical  switches. 

The  coiipled  equations  are  given  by 


(5) 

qu  +  3 ^q^qii+qi^iit^qi]  -  o 

+  ^^q2tn'^3(lqj^  Iqjl^)  3^qjqu+ qjq^i^qij  -  0 

The  equations  can  be  solved  by  the  inverse  scattering  transform 
method.  For  the  Nsl  soliton  one  obtains 

(7) 

qi  ■  ^  (*“®  -  cose  q(z.t) 

q2  ■  ^  +  cose  e”*^  *  q(zrf) 

(8) 


where 


q(W)  -  2ti  sech  pild  +  (4e(Ti* -3^^  +  2Q  z]  exp  -3q^ + z]  . 

p*§  +  in. 

The  angles  describe  the  relative  intensity  and  phase  of  the  input 
waves . 

A  figure  of  the  optical  switching  behavior  is  shown  below.  Part  (a) 
shows  the  output  switching  fraction  as  a  function  of  input  power, 
part  (b)  shows  the  optical  switching  behavior  at  low  input  intensity 


3?'^ 


and  part  (c)  shorn  tha  optical  switching  bahavior  at  high  ixiput 
intMsity. 


0») 


(c) 


CONCLUSIONS.  These  results  have  been  correlated  with  esqperimental 
parameters.  E:qperiments  are  planned  to  verify  the  concepts.  This 
would  be  the  first  demonstration  of  femtosecond  soliton  all-optical 
switching  which  could  advance  the  frontier  of  optical  devices  by 
several  orders  of  magnitude  and  constitute  a  breakthrough  of  the  physical 
understanding  of  high  data  rate  systans.. 
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Abstract: 

Beltrami  fields  yield  a  convenient  representation  of  the  electromagnetic  fields  in  a 
chiral  medium,  and  the  scattering  of  electromagnetic  waves  by  a  scatterer  immersed  in  a 
chiral  host  medium,  is  easily  tackled  using  Beltrami  fields.  In  this  work,  volume  integral 
equations  are  set  up  using  equivalent  Beltrami  current  sources  to  represent  a  chiral 
scatterer  in  a  chiral  medium.  These  equations  are  then  converted  into  algebraic  equations 
and  the  scattering  algorithms  obtained,  using  both  the  method  of  moments  (MOM)  and 
the  coupled  dipole  method  (CDM).  The  singular  behavior  of  the  dyadic  Green's  function 
for  Beltrami  fields  in  the  neighborhood  of  the  source  point  is  estimated  with  care,  and  the 
strong  and  the  xveak  forms  of  both  the  MOM  and  the  CDM  algorithms  are  derived. 

1.  Introduction: 

A  Beltrami  field  is  defined  as  one  that  is  proportional  to  its  curl  in  a  source- 
free  region.  Nature  provides  us  with  typical  examples;  hurricanes,  water  spouts 
and  vortex  flows  can  be  very  well  approximated  by  a  Beltrami  field.  In 
electromagnetic  theory,  Beltrami  fields  are  found  as  toroidal  and  poloidal  fields 
and  circularly  polarized  plane  waves.  Beltrami  fields  came  to  prominence  in 
electromagnetics  in  1974  due  to  Bohren  [1],  although  they  had  been  used  as  early 
as  1907  [2].  The  usefulness  and  application  of  Beltrami  fields  cannot  be  doubted, 
as  is  evident  from  [3-5]. 
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In  the  sequel  we  consider  scattering  by  a  chiral  inclusion  embedded  in  a  chiral 
host  medium.  The  electromagnetic  fields  that  are  incident  on  the  inclusion  <ire 
expressed  in  terms  of  Beiltrami  fields,  and  we  derive  expressions  for  the  scattered 
Beltrami  fields.  But  before  we  formulate  the  problem,  the  mathematical  concept 
of  Beltrami  fields  is  introduced  in  the  next  section.  In  section  3,  the  electric  and 
magnetic  fields  and  the  volume  current  densities  are  expressed  in  terms  of  their 
Beltrami  equivalents.  The  scatterer  is  then  replaced  by  equivalent  Beltrami 
current  densities  and  volume  integral  equations  are  obtained  for  scattering. 
Finally,  in  section  4  these  integral  equations  are  reduced  to  algebraic  ones  and 
two  algorithms,  the  method  of  moments  and  the  coupled  dipole  method,  are 
developed. 

2.  Beltrami  fields 

A  Beltrami  field  Q(r)  radiated  by  a  Beltrami  source  density  W(r)  satisfies  the 


relation 

VxQ(r)  +  KQ(r)  =  W(r),  (1) 

where  the  real  and  imaginary  part  of  k  are  of  the  same  sign  for  physical  validity. 
The  solution  to  such  an  equatioi^.  for  any  r ,  is 

QW  =  QtW  +  ffiv  G(r,r')*W(r’)  dV,  (2) 

where  is  the  solution  to  VxQ(r)  +  k  Q(r)  =  0  everywhere.  The  Green's 

dyadic  G(r,r')  for  the  Beltrami  field  satisfies  the  equation 

VxG(r,r')  +  k  GM  =  1 8(R),  R  =  r  -  r',  (3) 

where  8(R)  is  the  Dirac  delta  and  Hs  the  identity  dyadic.  The  Green's  function 
has  been  obtained  by  a  number  of  approaches  [6-8]  and  turns  out  to  be 

GM  =  -  (Id  +  VV/K  -  Vxl)  g(r,r'),  (4) 

where 

g(r,r')  =  exp(±iK|R|)/4jc|R|,  (5) 
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is  the  scalar  Green's  function  commonly  used  in  electromagnetics  and  acoustics- 
In  (5),  the  upper  sign  is  to  be  used  for  Re{K)  >  0  and  the  lower  for  Re{K}  <  0. 

Due  to  the  presence  of  the  VV  term  in  (4)  the  integrand  in  (2)  is  singular  at  R 
=  t-r'.  But  for  an  electrically  small  region  V,  whose  cross-sectional  dimensions 
are  much  smaller  than  the  wavelength  Ik/M,  the  held  at  the  source  point  can  be 
approximated  as  [9] 

Q(r')  =  Qb(r')  -  {K M  -  L/Kj^Win  r e  V,  (6a) 

where 

L  =  (1/4JC)  Jfav  [n'R/IRp]  d2r',  (6b) 

M=  (l/4jc)fflv([I-VxI/K]{ei’^«l/|R|} 

+  VV{(e‘'^*l  -  1)/k2|R|})  dV,  (6c) 

with  n'  being  the  unit  outward  normal  to  the  surface  dV  of  V  at  r'  6  dV. 

3.  Field  representation 

3.1  Chiral  host  medium 

With  these  preliminaries,  we  now  look  at  fields  in  a  chiral  medium.  In  the 
Drude-Born-Federov  representation,  the  firequency  domain  constitutive  relations 
for  a  chiral  medium  are 

D  =  e,E  +  EaP.VxE,  B  =  +  4,P,VxH,  (7a,b) 

where  and  are  the  permittivity  and  the  permeability  scalars,  respectively, 
and  Pg  is  chirality  parameter  of  the  host  medium.  An  exp[-i(ot]  time  is  implicitly 
assumed  throughout  this  analysis.  On  transforming  the  E  and  H  fields  into 
Beltrami  fields  [10],  the  Maxwell  curl  postulates  can  be  written  as 

VxQi  -  y,Q,  =  W„  VxQj + Y2  Q2  =  ^2'  (8a,b) 

with  the  Beltrami  fields  given  by 

Q,=  [l/2)(E  +  in.H),  (9a) 

Qj=  tl/2][H+iE/ii.],  (9b) 


393 


and  the  Beltrami  source  current  densities  by 

w,-  IVZkJBnJ-Kl,  (9c) 

Wj.  tY2/21tJH(l/il.)K+ JJ.  (9d) 

The  two  wavenumbers  in  the  host  medium  are  found  to  be 

Yj  =  k./(l-k,P.),  (10a) 

Y2  =  k./(l+k.P.),  (10b) 


where  k,  =  0)V(e,^iJ  is  used  for  convenience  in  notation;  and  =  V(jij/ ej  is  the 
impedance  of  the  chiral  medium.  It  is  assumed  here  that  Re{Yi/Y2}  >  0,  Im{Yj,y2}  > 
0  and  Re{ii,}  >  0;  and  it  is  also  noted  that  Q,  is  a  left-handed  field  and  Qj  is  a 
right-handed  field  [1, 11]. 

Thus,  to  solve  for  the  Beltrami  Helds  Qi(r)  and  Q2(r),  suitable  modiHcations 
of  (2)  yield 

Q.(f)  =  Q,b(f)  +  (Y,+ Yj)  fflv  S,(r.r').W,(r')  dV,  (11a) 

Q2(r)  =  Qji,(r)  -  (Y,+  Yt)  III,  9j(r.r').Wj(r')  dV,  (11b) 

where  V  now  is  a  region  not  necessarily  small  in  electrical  size,  while  the  dyadics 

Gi(r,r')  =  (Yi+ Y2)‘‘ G(r,r')|^^ 

=  (Yii  +  VV/Yi  +  V>^  {exp(iYilR|)/47c|R|}/(Yi+Y2)/  (12a) 

G2(r,r')  =  -  (Yi+  Yj)"!  G(r,r')|^^ 

=  (Y2I  +  ^^/Y2  -  V>d)  {exp(iY2lR|)/47t|R|}/(Yi+Y2)-  (12b) 

3.2  Equivalent  current  densities 

Equations  (lla,b)  are  volume  integral  equations  which  can  be  used  to  solve 
for  radiation  from  a  given  source  distribution.  If,  however,  we  consider 
scattering,  Qii,(r)  and  Q2h(*‘)  are  interpreted  as  the  fields  that  exist  in  the 
absence  of  the  scatterer  and  are,  therefore,  the  incident  fields.  Our  task  now  is  to 
replace  the  scatterer  by  current  densities  which  are  equivalent  to  it. 
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Let  all  space  be  divided  into  two  mutually  exclusive  regions  and 
which  represent  the  host  and  the  inclusion  media,  respectively.  The  region  is 
filled  with  chiral  matter  obeying  the  constitutive  relations 

D(r)  =  Et,  E(r)  +  ebfH,VxE(r),  r  e  (13a) 

B(r)  =  p,  H(r)  +  m.ft,VxH(r),  r  e  (13b) 

where  e,,  and  are  the  permittivity  and  the  permeability,  respectively,  and  is 
the  chirality  parameter  of  the  inclusion  material. 

The  source-free  Maxwell  postulates  can  be  written  everywhere  as 
VxE(r)  -  [k.2p.  E(r)  +  io)(i.  H(t)]/(l-k3.2) 

=  (-K^(r)  +  f  «  V„,+  V|„,  (14a) 


VxH(r)  -  tk.2p.  H(r)  -  i«)e.  E(r)]/(l-k.2p.2) 

=  II., W  +  i<iie.p.K^(r))/(l-k.2p,2), ,  a  .  (14b) 

Here. 

J^(r)  =  0,  reV^,,  (15a) 

K^(r)  =  0,  reV,,j,  (15b) 

J^ir)  =  io)(a^E(r)  +  a,,H(r)],  r  e  (15c) 

K^(r)  =  ia)[a^E(r)  +  au.H(r)],  r  €  Vj^,  (15d) 

and  the  quantities 


lo,2a  1  r  £3  e  ■ 

_i-VPb^  ■  i-VPa^J  [l-W  ■ 


(16a) 


'  _  4a  1  ^  1  r  VPh  _  K%  ' 

1-W  1-VPa'J  ia>Ll-VPb'  1-VPa'J' 


Ife— 

1-W. 


1  r  ' 

ia)[l-kb^Pb2 


(16b) 

(16c) 


l-ka2Pa2_ 


4b  _  4a 

1-KW  ■ 


(16d) 
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Knowing  the  electric  and  magnetic  current  densities,  and  using  (9c,d)  we  can 


set  down 

VxQ,(r)-Y,Q,(r)»  W,^(t),r€  V„,+ (17a) 
VxQj(r)+T2Qj(r)  =  Wj^(r),r€  (17b) 

where  the  equivalent  Beltrami  source  current  densities  are  specified  by 

Wi^(r)  =  0.r€  (18a) 

=  (18b) 

'^ieq(*')  =  [Yi/21cJ  [aj,Qi(r)  +  ai2Q2(r)],  r  e  (18c) 

'^2eq(*’)  =  [Y2/2kJ  [a2iQi(r)  +  a22Q2(r)I/  »  €  (18d) 

with 

an  =  i(o{(a.h  -  a^^)  +  i(Ti,-‘  a^,  +  q.  a J),  (19a) 

ai2  =  Ha^a^e-  a^, )  +  iTi,(a^^  +  a^e)},  (19b) 

a2i  =  ico{(a^-  a^,  /n^)  -  (iAi,)  (a^  +  a^ )},  (19c) 

a22  =  i(o  {(a^h  “  aj  -  i(Tl,‘  a^,  +  q,  a J}.  (19d) 


Thus,  through  (18a-d)  the  Beltrami  field  scattering  problem  has  been  altered  to  an 
equivalent  Beltrami  field  radiation  problem.  As  the  equivalent  current  densities 
are  identically  null  in  the  volume  integral  equations  for  Beltrami  fields  take 
the  form 

Q.(')  =  QiteW  +  ®Vi„,  [A,  ,(f,r')*Q,(t')]  dV 


(20a) 

QaW  =  Q2i«W  +  dSr- 

■^ffWA22{r,r').Q2(r')]d3r-,  re  V„,+  V^„ 

(20b) 

with  the  dyadic  kernels 

Ai,(r/)  -  a„  g/r,^ 

(21a) 

^12^-  ai2Gi(r,r'), 

(21b) 

^2i(*''*^)  “  ~  2  a2i  ^(r,!^). 

(21c) 
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^(ry)  =  -  *42  gjCry).  (21d) 

F6r  prescribed  incident  fields,  Qjj^  arul  (20a,b)  have  to  be  solved  for  the 
Beltrami  fields.  The  results  have  to  then  be  put  back  to  find  the  scattered  fields. 
To  solve  these  volume  integral  equations,  we  use  the  longwavelength 
approximation  to  rewrite  these  equations  in  algebraic  form  and  develop  two 
scattering  algorithms  -  the  method  of  moments  and  the  coupled  dip>ole  method'. 

4.1  Method  of  Moments  (MOM) 

In  1968  Harrington  [12]  introduced  into  electromagnetic  engineering 
community  the  term  "method  of  moments"  to  denote  an  approach  by  which  a 
linear  operator  equation  is  transformed  into  a  set  of  algebraic  equations.  Since 
then  more  sophisticated  versions  have  been  developed  [13,14],  but  the  original 
and  the  simplest  version  suffices  for  our  purposes. 

In  order  to  convert  the  volume  integral  equations  into  algebraic  ones,  the 
region  is  partitioned  into  simply  connected,  non-overlapping  subregions 
(m  =  1,2, ....,  M),  each  bounded  by  a  surface  on  which  a  unique  unit  outward 
normal  can  be  unambiguously  prescribed,  and  is,  therefore,  at  least  once- 
differentiable.  The  surface  of  the  subregion  is  assumed  to  be  convex,  and  the 
maximum  chord  of  the  subregion  is  assumed  small  compared  to  the 
wavelength  in  both  and  such  that  longwavelength  approximations  hold. 
As  a  consequence  of  the  longwavelength  approximation,  we  can  assume  Q](r)  = 
Qj(r^j)  and  Q2(r)  =  Q2(*'m)  sU  r  e  V^,,  where  r^j,  is  a  distinguished  point, 
generally  the  centroid  of  V^,.  Next,  the  approximate  evaluations 

/i^v„  [Gi(vr')*0(r')]  d^r'  =  u„  §j(r„,r„)*0(r„),  n  m,  (22a) 

[-L„  +  Tf,^,  J-KfrJ,  (22b) 

-  12 
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(22c) 


Jlfv„ [G2(r^r')*0(r')] dV  =  \)„  G2(r^r„)»0(rJ, n*m  , 

(G^»„y)*®(r')]  dV  s  (-L„  +  (22d) 

follow,  with  being  the  volume  of  V^,  and  d>  representing  either  Q^,  or  Qj  in  the 
subregion  of  integration  and  the  dyadics 

^  =  (1/4*)  /4v„  [n'R„/|Rn,l’l  dV,  (23a) 

M,„  =  (l/4it)  IfJv^  (Q  +  V>d/Y,|{e^'l*"l/|R„|} 

+  WKe^ilxJ  -  1)/y3R„I1)  dV,  (23b) 

(l/4it)If;v„(lI-^>l/rj](e*»l“d/H«„l) 

+  VV{(eWd  -  1)/Yj’|RJ|)  d V,  (23c) 


with  ^n,  =  and  n'  being  the  unit  outward  normal  to  the  surface  dV^  at  r'  e 
dV^.  We  observe  that  the  union  of  is  only  approximately  congruent  with 
in  practical  situations.  Also  to  be  noted  is  that  the  use  of  only  the  L  dyadic 
constitutes  the  "weak"  form  of  MOM  [9],  while  the  use  of  both  the  L  and  the  M 
dyadics  is  the  "strong"  form  thereof. 

With  these  approximations,  the  volume  integral  equations  can  be  converted 
into  6M  algebraic  equations  in  terms  of  the  cartesian  components  of  the  fields 
Qj(r„)  and  QjCr).  Thus, 

Qlinc(fm)  =  . M  (f  U.mn*Ql(«’n)  +  1  ^  m  <  M,  (24a) 

Q2inc(*‘in)  ~  ^n=l,2 M  iP21,inn*Ql(*’n)  ?224nn*Q2(01'  1  ^  ni  <  M,  (24b) 


where 

Iqsonn  =  -  <1  =  ^'2;  s  =  1,2;  m#n,  (25a) 

f  iidnm  =  I  -  [l/21^a]  [-L„  +  M^  Ja„,  (25b) 

®124nm  “  ~  [1  /2ka]  [“^m  ^  Yi^  ^lml®12 '  (25c) 

s21  jnm  ~  (1  /2ka]  MjmJajj ,  (25d) 

s  224nm  ”  ^  ^  /^kj  [~Ln,  +  Y2^  ^2mJ®22  ’  (25e) 
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The  matrix  equations  (24a,b)  can  be  solved  using  standard  matrix  methods 
provided  they  are  not  algorithmically  singular.  The  scattered  Beltrami  fields  can 
be  computed  through  (20a^)  as 
Qi.c.W  =  QiW-QiincW 

=  [Aii(r,r„)«Qi(r„)  +  Aj^Cr^J^OjCrJ],  r  e  V„,.  (26a) 

Q2K.W=Q2W~Q2iiic(*‘) 

AJl  Rotability  dyadics 

In  the  MOM  the  unknowns  are  the  fields  that  are  actually  present  at  the 
subregion.  For  the  development  of  the  coupled  dipole  method,  we  need  the 
fields  that  excite  the  m^  subregion.  The  field  exciting  is  the  sum  of  the  field 
that  is  incident  on  the  subregion  and  the  fields  that  are  scattered  by  the  other 
subregions.  In  other  words,  the  field  exciting  is  the  actual  field  in  the 
subregion  if  it  were  to  be  occupied  by  the  host  medium.  Thus,  by  definition 


Qlexc(*’in)  Qliiic(*'m)  ^d=l,2,...>l3»Mn  I®lljnn*Ql(*'n) 

+  ii2jnD*Q2(*‘n)]/ 1  ^  m  ^  M,  (27a) 

Q2exc(*'m)  ~  Q2iiic(*^m)  ~  (®21jnn*Ql(*^n) 

+  |22jnn*Q2(*’ii)I'  1  ^  m  ^  M,  (27b) 

so  that  (24a,b)  simplify  to 

Jll4nm*Ql(0  +  §12.mm*Q2(*‘m)  =  QlexcCO'  (28^) 

i21.mm*Ql(>’in)  +  |224nni*Q2('in)  =  Q2exc(*’m)-  (28b) 

Hence, 

*Qlexc(*in)  ^12.in*Q2exc(*’m)  ~  Ql(*’m)'  (28c) 

^21411*016x0(0  522411*020x0(0  “  02(0'  (28d) 

with 

^11411“  ~  (2l24iim*5224nin’**§2l4iiin"2ll4ninJ'*'  (2^®) 
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Sl2jn  *  t§21jiiiii  “  §22jiun*§12^''  •Sllonml’^' 
lSl2jmn”fll4iim*22lJ®®  **5224^ 

S224n  =  ■"  I52l4nm*§lljnm’^*Sl24iun  “  l22jninJ’*' 

In  keeping  with  Varadan  et  al.  [15],  rotpole  moments  may  be  defined  as 
t|„-(i/<i))u„W|^(r„),  (30a) 

-  (i/<»)u„Wj^(0,  (30b) 

these  rotpole  moments  serving  as  the  Beltrami  analogs  of  the  electric  dipole 
moment  p^^  and  the  magnetic  dipole  moment  m^^.  It  follows  from  (18c,d)  that 

‘im  =  (i/“)  lYi/2k«]  Uii  Qi(fJ  +  ai2Q2M'  (31a) 

t2„  =  (i/Q))  t)„  [Y2/2kJ  [aji  Qi(r J  +  a22  Q2(*‘m)l'  (31b) 

whence,  using  (28c,d), 

‘im  =  ^nan*QuJO  +  5l24n*Q2exc(0'  (32a) 

l2m  =  52l4n*Qlexc(0  +  S22^*Q2exc(»m)'  (32b) 

with  the  rotability  dyadics  given  as 

^llon  “  (^/®)  (Yi/2ka]  [aji  ai2C2jj„],  (32c) 

;i2.n=(i/®)^a.  [Yi/2kJ[a„Ci2^+a,292an^  (32cl) 

Jaijn ~  (i/®)  (Y2/2ka]  ^21  a22C2ij„l,  (32e) 

?22jn  “  (^/®)  la2i  Cjjjn  +  a22  ^22jn\-  (320 

4.3  The  coupled  dipole  method  (CDM) 

The  expressions  for  the  rotpole  moments  derived  in  the  preceding  section  can 


be  used  for  another  numerical  approach;  namely  the  coupled  dipole  method. 
This  method  has  been  variously  known  as  the  discrete  dipole  method  and  the 
Purcell-Pennypacker  approach  and,  as  the  third  name  suggests,  was  heuristically 
derived  by  Purcell  and  Pennypacker  in  1973  [16].  It  has  gained  ground  for  the 
study  of  scattering  in  recent  years  [17-19].  The  basis  of  this  method  lies  in 
assuming  that  each  subregion  can  be  thought  of  in  dipolar  terms.  The  total 
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scattering  can  then  be  calculated  by  summing  die  radiation  due  to  die  dipoles  or, 
as  in  this  case,  rotpoles. 

In  a  slighdy  diderent  form,  expressions  for  the  exciting  Beltrami  fields  (27ajb), 
can  be  written  as 

~  Qliiic(*in)  +  (Tr,'^2)2b=ia.  )l  (33a) 

Q2exc(0  =  Q2inc(0  “  (Y1+Y2)  k§2».*W2^(0}-  (33b) 

Substitution  for  the  equivalent  Beltrami  source  densities  by  the  rotpoles  leads  to 

Qlexc(^m)  ”  Qliiic(*'m)  “  MYi+Yi)  (^^) 

Q2exc(0  =  QlincCO  +  MYi+Yj)  (34b) 

Finally,  the  use  of  (32a,b)  )delds  the  CDM  equations 

Qliiic(*'iii)  “  l§ll^*Qlexc(*"n) 

+  §l24nn*Q2exc(*'n)l/ 1  ^  ^  M,  (35a) 

Q2iiic(*'m)  ~  ^11=1  A— .M  (P2l4nn*Qlexc(0 

+  S224nn*Q2exc(*‘n)l'  1  ^  m  S  M,  (35b) 
where  the  6M  unknowns  are  the  components  of  Qiexc(*’m)  Q2exc(*'m)' 


dyadics  involved  are  given  as 

Pllj«.  =  I?m.  +  '«>(l-««..)('ri'^2)gl«.‘5n^  (36») 

Pl2j«n=  ‘<»(l-S™XYin2)gl„*;i2j,  (36b) 

giijm  =  -i«i(l  -  S^XTi+Yj)  G2„m*52ij,  (36c) 

g22jmi  =  I?nm-i«>(l-S.„)(Yi+Y2)GaM*»!2j,-  (36d) 

The  scattered  fields  in  the  CDM  can  be  computed  after  the  calculated  excitation 
fields  have  been  used  to  determine  the  dipole  and  the  rotpole  moments.  Thus, 
QiscaC**)  =  -  i®(Yi+Y2)  {Gi(r,r„)*ti J,  r  €  (34a) 

Q2sca(*’)  =  io>(Yi+Y2)  2:„=i.2,.  ,M  {G2(i'.r„)*t2D}Y  >■  €  '^cxv  (34b) 
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5.  Concluding  Remarks 

To  summarize,  we  have  considered  scattering  of  Beltrami  fields  from  a  chiral 
inclusion  embedded  in  a  chiral  host.  In  our  treatment,  we  replaced  the  scatterer 
by  equivalent  current  densities  and  then  volume  integral  equations  were 
obtained  for  the  scattered  fields.  These  equations  were  then  reduced  to  algebraic 
equations  and  two  algorithms  -  method  of  moments  and  the  coupled  dipole 
method  -  were  developed.  These  equations  can  be  easily  solved  using  standard 
matrix  manipulations  provided  they  are  not  algorithmically  singular.  The 
expressions  derived  here  are  much  more  simpler  in  form  than  the  corresponding 
ones  for  the  electric  and  magnetic  fields.  It  is  also  important  to  note  that  the 
scattering  problem  is  solved  for  an  arbitrary  shape  of  the  inclusion.  Finally,  the 
the  problem  solved  in  this  paper  forms  a  part  of  Ref.  20  and  21. 
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1  Introduction. 

We  study  shock  waves  in  dispersive  semi-infinite  particle  chains  generated  by 
an  imposed  motion  on  the  first  particle.  We  assume  the  imposed  velocity  of 
the  first  particle  to  be  a  periodic  function  of  time  with  positive  mean  value. 
The  first  particle  moves  toward  the  others  compressing  the  chain.  For  a  chain 
with  nearest  neighbor  interactions  the  motion  is  described  by: 

(1.1a)  =  F(x„+1  -  In)  -  F(ln  “  ^n-l). 

71  Xj  3j  *  •  •  j 

The  initial  positions  and  velocities  are  given  by: 

(l.lb)  3;n(0)  —  0,  ®n(0)  ~  0  n=l,2, ..., 

and  the  forcing  velocity  is  given  by 

(1.1c)  xo{t)  =  2o  f{t)  t  >  0,  o  >  0, 
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where  Xn(t)  is  the  deviation  of  the  n*^  particle  from  an  initial  rest  position  and 
f(t)  is  a  zero-mean  periodic  function  of  time. 

We  first  outline  the  case  of  constant  forcing  (f(t)  =  0).  Holian  and  Straub 
[4]  took  various  force  laws  F(’)  associated  with  the  names  Lennard-Jones, 
Morse,  and  Toda,  and  investigated  the  limit  t  —*  oo.  Through  numerical 
experimentation,  they  not  only  confirmed  an  earlier  finding  of  von-Neumann 
[8]  that  the  shock  wave  is  reflected  from  the  origin  with  large  oscillations  be¬ 
hind  the  shock,  but  discovered  a  striking  new  phenomenon,  the  existence  of  a 
critical  shock  strength  OcHt  with  the  following  property: 

(i)  For  a  <  Cent  the  solution  of  the  shock  problem  is  oscillatory  in  the  region 
s^t  <n<  Sit  while  it  tends  to  zero  in  the  region  0  <  n  <  sjt,  as  t  tends 
to  infinity.  Here  si  is  the  shock  speed  and  $2  a  second  kind  of  speed; 
both  Si  and  sj  are  functions  of  a. 

(ii)  For  a  >  aerit  the  solution  is  oscillatory  everywhere  behind  the  shock,  i.e. 
for  0  <  n  <  Sit.  Yet,  the  quality  of  the  oscillations  is  different  in  the  inner 
region:  it  is  periodic  in  time  and  binary  in  space  i.e.  neighboring  particles 
are  moving  with  opposite  velocities  relative  to  the  forcing  particle  except 
in  the  region  of  the  zeroth  particle  itself  where  motion  is  somewhat  more 
complicated.  This  is  precisely  explained  in  [7]. 

The  existence  of  a  critical  forcing  velocity  is  not  yet  generally  understood. 
In  the  special  case  of  the  Toda  chain  (also  referred  to  as  the  Toda  lattice)  which 
has  exponential  force  law  F{r)  =  — e"’*,  Holian,  Flaschka  and  McLaughlin  [3] 
utilized  complete  integrabUity  to  analyze  the  von-Neumann  problem  and  de¬ 
rived  the  critical  speed  a  =  1  and  the  shock  speed  si(a).  Indeed,  by  translating 
the  coordinates  so  that  the  particle  xq  remains  fixed  and  reflecting  the  chain 
about  this  particle,  one  imbeds  the  von-Neumann  problem,  which  is  nonau- 
tonomous,into  an  autonomous  initial  value  problem  for  a  doubly  infinite  chain 
(— oo  <  n  <  oo).  The  particles  in  this  chain  still  satisfy  equations  (l.la). 
They  have  zero  initial  deviations  and  have  initial  velocities  given  by  —2a  sgn 
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where  n  is  the  particle  index.  In  the  case  of  the  Toda  chain  the  new  problem 
is  completely  integrable. 

In  the  integrable  problem,  criticality  arises  from  the  fact  that  the  continuous 
spectrum  of  the  Lax  operator,  given  by  the  set  {-o-l,-a  +  l]u[a-l,o  +  l} 
consists  of  a  single  interval  in  the  subcritical  case  0  <  a  <  1  and  of  two  disjoint 
intervals  in  the  supercritical  case  a  >  1. 

Venakides,  Deift  and  Oba  [7]  studied  the  supercritical  von-Neumann  prob¬ 
lem  for  the  Toda  chain  by  analyzing  the  long  time  behavior  of  the  r-function  of 
the  imbedding  integrable  problem  on  —  oo  <  n  <  oo.  They  showed  the  emer¬ 
gence  of  oscillations  whose  structure  they  obtained  in  detail  and  derived  the 
speeds  si  and  S2-  (They  denote  these  speeds  by  N^ax  and  Nmin  respectively.) 
Using  the  same  technique,  Kamvissis  [5]  derived  the  oscillatory  structure  in 
the  critical  and  subcritical  case. 

The  striking  result  in  [7]  and  [5]  is  th:  ■  ^he  residual  state  of  the  chain  i.e.  the 
state  of  motion  in  the  region  |n|  <  which  each  particle  eventually  enters, 
is  described  by  an  algebraic/geometric  type  solution  whose  Lax  operator  has 
exactly  the  same  spectrum  as  the  Lax  operator  of  the  original  integrable  shock 
problem.  In  other  words,  (a)  the  problem  remains  isospectral  even  after  a 
spatially  non-uniform  limiting  process  {t  oo)  eliminates  the  region  s^t  <  |n| 
by  pushing  it  to  infinity  and  (b)  in  the  residual  state,  there  are  no  more  degrees 
of  freedom  except  those  dictated  by  the  spectrum:  the  number  of  degrees  of 
freedom  equals  the  number  of  spectral  gaps  which  in  this  case  is  effectively 
equal  to  one. 

When  the  forcing  velocity  f{t)  is  no  longer  constant,  numerical  experiments 
(see  below)  indicate  that  the  residual  state  is  again  described,  in  the  region 
n  <  const. t,  by  an  algebraic/geometric  type  solution,  but  now  the  number  of 
gaps,  cmd  hence  the  number  of  degrees  of  freedom,  may  be  greater  than  one. 

Unfortunately,  the  process  of  imbedding  the  semi-infinite  chain  into  an 
integrable  chain  does  not  work  when  the  forcing  velocity  is  time  dependent. 
Although  the  problem  may  still  be  integrable,  as  one  might  be  led  to  suspect 
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by  the  recent  work  of  Fokas  and  Its  [FI]  on  initial- boundary  value  integrable 
problems,  no  linearizing  transformation  has  yet  been  found.  Faced  with  the 
inability  to  reflect  the  problem  we  ask  ourselves:  How  does  the  spectrum  of  the 
residual  state  manifest  itself  in  the  original  forced  semi-infinite  Toda  chain? 

The  evolution  of  the  Lax  operator  of  the  semi-infinite  chain  is  not  isospec- 
tral;  the  Lax-pair  equation  has  a  rank-one  perturbing  term  which  contains  the 
forcing.  The  initial  (t  =  0)  continuous  spectrum  of  the  Lax-operator  for  the 
semi-infinite  chain  consists  of  the  single  interval  [a— 1,  a+lj.  The  spectral  evolu¬ 
tion,  studied  numerically,  is  as  follows.  When  the  forcing  velocity  is  constant, 
eigenvalues  are  emitted  from  the  lowest  point  of  the  continuous  spectrum  at  a 
constant  rate  and  move  to  the  left  filling  the  band  [—a  —  1,  —a+ 1]  (actually  the 
band  [—a  —  1,  a  —  1]  when  a  <  1)  in  the  limit  <  — »  oo,  which  we  also  consider  as 
continuous  spectrum.  The  limiting  “continuous”  spectrum  of  the  Lax  operator 
is  identical  to  the  continuous  spectrum  of  the  residual  state.  The  emission 
of  eigenvalues  from  the  continuous  spectrum  was  first  observed  by  Kaup  and 
Neuberger  [6],  who  make  an  approximate  calculation  of  the  eigenvalue  birth 
rate.When  the  forcing  velocity  contains  a  periodic  component  (/  ^  0),  the 
emitted  eigenvalues  may  fill  more  than  one  band  corresponding  to  a  residual 
state  that  is  a  multiphase  wave. 

In  our  analysis  of  the  periodically  forced  problem  we  achieve  the  following: 

(a)  We  derive  a  closed  system  for  the  evolution  of  the  scattering  data,  includ¬ 

ing  the  evolution  of  eigenvalues. 

(b)  We  mahe  the  Ansatz  that  the  eigenvalues  asymptotically  cluster  onto  a 
finite  set  of  bands.  (This  is  confirmed  by  numerical  experiment  in  the 
subcritical  case.)  We  then  take  the  continuum  limit  of  the  eigenvalue 
evolution  equations  to  derive  an  integral  equation  for  the  asymptotic 
(t  — »  oo)  spectral  density  of  these  bands.  We  cannot  yet  determine  the 
number,  and  the  endpoints  of  the  bands  (if  2g  numbers  are  needed  we 
only  have  g  relations).  However  if  we  assume  the  endpoints  given,  then 


408 


we  can  correctly  calculate  the  corresponding  spectral  densities. 

(c)  In  the  snbcritical  case  of  the  Toda  chain,  and  when  the  periodic  com¬ 
ponent  of  the  forcing  is  small,  we  rigorously  construct,  imder  minimal 
assumptions,  solutions  to  the  chain  equations  that  are  valid  for  n  >  0 
and  — oo  <  t  <  oo  and  in  which  the  velocity  of  the  zeroth  particle  is  a 
given  time-periodic  function.  The  solutions  are  multiphase  waves  away 
from  the  boundary  and  they  connect  to  the  forcing  function  through  a 
boundary  layer.  When  the  frequency  exceeds  a  threshold  value,  there  is 
no  wave  penetration  into  the  chain,  there  is  only  a  boundary  layer  that 
decays  exponentially  vdth  the  particle  index.  As  the  frequency  decreases, 
more  phases  are  generated  according  to  a  precise  formula.  If  the  number 
of  the  wave-phases  is  g,  the  location  of  the  midpoints  of  the  g  spectral 
gaps  is  determined  by  the  frequency  of  the  driver.  Their  widths  as  well  as 
the  corresponding  phase  shifts  (2^  pieces  of  information)  are  determined 
by  the  first  g  Fourier  coefficients  of  the  driver  {2g  pieces  of  information, 
since  these  coefficients  are  in  general  complex).  The  remaining  part  of 
the  information,  i.e.  the  remaining  Fourier  coefficients  of  the  driver  deter 
mine  the  boundary  layer  The  calculation  involves  a  Liapunov  Schmidt 
decomposition,  the  definition  of  an  appropriate  family  of  norms  and  the 
use  of  the  implicit  function  theorem.  We  can  carry  through  the  above 
calculation  for  a  general  nonintegrable  chain,  if  we  restrict  ourselves  to 
single  phase  waves.  This  is  equivalent  to  requiring  that  the  frequency  of 
the  driver  be  sufficiently  large.  We  have  not  yet  been  able  to  construct 
ge  leral  nonintegrable  chain  multiphase  waves  due  to  problems  with  small 
divisors. 

In  this  presentation,  we  focus  on  the  case  in  which  the  periodic  component 
of  the  driver  is  not  small,  i.e.  as  described  in  points  (a)  and  (b)  above. 
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2  The  Evolution  Equations 

We  use  the  Flaschka  variables 

(2.1)  0.  =  -^,  6,  =  ieC-— +•)/*  ,n  =  0,l,2,... 

and  we  note  that  the  function  ao(f)  is  the  given  time-periodic  forcing  function. 
Equations  (1.1a)  for  the  semi-infinite  chain  with  F{r)  =  e~'  (Toda)  are  easily 
reduced  to  the  perturbed  Lax  pair  equation: 

(2.2)  ^  +  =  -p{t)P 


where  M  is  the  tridiagonal  matrix 


M  = 


bi  02  b2  0 
h2  03 

0  •  . 


B  is  the  antisymmetric  tridiagonal  matrix  given  by 


B  = 


0  61 

—61  0  62 

0  — 61  0  62 


P  is  the  rank-one  matrix  given  by: 


1  if  i  =  j  =  l 
0  otherwise 


and  p(t)  is  the  function: 


p{t)  =  2bl{t)  =  26f(«)  -  di(t),  • 


£ 

dt 


Strictly  speaking,  the  matrices  M  and  B  are  semi-infinite.  However,  trun¬ 
cating  the  chain  at  some  particle  of  very  large  index  N  makes  the  matrices  M 
and  B  finite.  The  disturbance  in  the  chain  caused  by  the  truncation,  travels 
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essentially  with  finite  velocity.  Only  exponentially  small  effects  display  infinite 
speed.  Thus,  a  large  part  of  the  chain,  say  the  left  half,  does  not  essentially 
feel  the  truncation  until  a  time  t  =  0{N).  Our  analysis  is  in  the  regime 
1  « t  «  N  and  n  «  N  of  &  finite  chain. 

Our  strategy  is  to  derive  evolution  equations  for: 

(a)  the  eigenvalues  A^-  of  the  truncated  matrix  M . 

(b)  The  first  entry  fj  of  the  j*^  eigenvector  of  M(j  =  1, . . .  ,iV)  when  it  is 
normalized  to  have  Euclidean  length  equal  to  one. 

Given  the  N  eigenvalues  Aj  and  the  first  eigenvector  entries  /j,  we  can 
reconstruct  the  matrix  M  [7]. 

Theorem  2.3.  The  evolution  of  the  A^’s  and  fj's  is  given  by: 

ln(-A,)  =  Aj  -  ao(t)  +  =  1, . . . ,  iV. 

/?  = 

I  •'J  p 

where  p  —  2hl^{t)  =  —  Sili  Ki  ^  do*  indicates  a  derivative  with  respect  to 
time. 

The  initial  values  Aj(0)  are  the  eigenvalues  of  Af  at  t  —  0  while  the  initial 
values  Aj(0)  are  given  by 

X,(0)  =  -2^(0)/i'(0). 

Proof:  Let  A  be  the  diagonal  matrix  of  the  eigenvalues  Xj  of  M  and  let  $  be 
the  matrix  whose  column  is  the  normalized  eigenvector  of  M  corresponding 
to  the  eigenvalue  Xj.  We  have: 

(2.4a) 

Let 

(2.4b)  $  =  ^  where  •  = 
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utilizing  equations  (2.4a)  and  (2.4b),  we  easily  calculate: 

(2.4c)  Mi  -ih.  =  ik  +  pPi. 

We  now  define  the  matrix  A  =  (a,j)  by  the  relation: 

(2.4d)  i  =  iA. 

We  calculate  {A^  is  the  transpose  of  A): 

A^-¥A  = 

at 

The  last  equality  is  true  because  the  matrix  is  orthogonal.  Thus 

(2.5)  A^  +  A  =  0; 
i.e.  A  is  antisymmetric.  Using  (2.4c)  we  obtain 

(2.6)  M#  -  =  Mi  A  ~  iAA  =  «'(AA  -  AA). 

Compariog  (2.4c)  with  (2.6)  we  obtain  easily: 

(2.7)  A  =  [A,A]-p'f^F'f 

Let  =  (/i,  /2, . . . ,  /at)  be  the  first  row  of  Then 

i^Pi  =  ff 

where  the  righthand  side  is  a  matrix  product.We  insert  this  in  (2.7). 

(2.8)  A  =  [A,A]-p//^. 

Equating  the  diagonal  elements  on  both  sides  we  obtain: 
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(2.9) 


Ai  =  -0//. 


~  P' 


This  proves  the  second  relation  in  Theorem  2.3. 

Off  the  diagonal  in  (2.8)  we  have  for  i  ^  j:  AjOjj  —  OijXj  =  pfifj,  hence: 

(2.10)  Oij  =  when  1 9^  j; 

Om  =  0  by  (2.5).  We  now  calculate  the  evolution  of  fj.  By  (2.4b): 

*  =  ♦  +  B'®  =  'i'A  + 

Specializing  this  to  the  first  row  we  obtain:  =  f^A  +  B/j,  ®  where  Bji,  is 

the  first  row  of  B. 

Taking  transposes  and  factoring  we  obtain: 

(2.11)  f  =  {A-aiI-  A)f. 

We  write  the  antisymmetric  matrix  A  obtained  in  (2.10)  as 

(2.12)  A  =  pFLF 

where  F  is  the  diagonal  matrix  with  entries  /i,.-.,/iv  and  L  is  the  matrix 
(v^)-  obtain  /  =  (A  -  oj/  —  pFLF)F. 

(2.13)  F-^f  =  F~^Af  -  aiF-^f  -  pLFf. 


We  then  remark  that: 


(i)  pFf  =  p 


by  (2.9), 
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(ii)  F-^S  = 


1 

1 


,  F-^A  =  AF-». 


Substituting  in  (2.13)  we  obtain 
(2.14)  ^  =  A,-  -  oi  +  x^- 

The  evolution  of  the  Aj’s  in  (2.3)  is  finally  obtained  by  eliminating  fj  be¬ 
tween  (2.14)  and  (2.9)  and  using  the  expression  for  p  given  in  (2.2). 

3  Numerical  Results 

In  Figures  la  and  lb  respectively,  we  show  the  position  and  velocity  profile  of 
the  chain  when  a  constant  subcritical  velocity  x{t)  =  xo{t)  =  1  is  imposed  on 
the  leading  particle.  In  Figures  2a  and  2b,  we  display  the  same  information 
but  now  for  a  supercritical  driver  x{t)  =  xo(0  =  4.  In  this  case  one  observes 
residual  binary  oscillations  in  a  neighborhood  of  the  forcing  particle.  The 
length  (^number  of  particles)  of  the  chain  over  which  oscillations  occur  is 
given  asymptotically  by  sit  while  the  length  of  the  chain  that  displays  binary 
oscillations  equals  sst.  Both  speeds  si  and  $2  are  calculated  in  [7]. 

In  Figures  3  through  8  we  present  the  asymptotic  (large  t)  position  profile 
and  the  spectral  profile  of  the  lattice  when  a  zero-mean,  time-periodic  pertur¬ 
bation  is  added  to  a  subcritical  forcing  velocity.  We  vary  the  frequency  and  the 
amplitude.  We  have  chosen  three  frequencies  and  two  amplitudes.  We  label 
the  latter  as  ’’large  amplitude”  and  ’’small  amplitude”. 

We  observe  that  when  the  frequency  is  large  enough,  the  oscillations  do  not 
penetrate  the  chain  (see  Figures  3a  and  4a).  The  continuous  spectrum  emits 
eigenvalues  that  fill  up  the  interval  (-1.5, -0.5)  as  is  seen  in  Figures  3b  and 
4b.  Asymptotically  the  spectrum  changes  from  the  interval  (-0.5, 1.5)  to  the 
interval  (-1.5, 1.5).  The  new  spectrum  corresponds  to  the  residual  compressed 
state  of  the  chain  that  results  from  the  constant  part  of  the  forcing. 
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The  phenomenon  of  non-penetration  of  the  chain  high-frequencies  is 
easily  understood  in  the  case  of  a  linear  chain  where  the  threshold  frequency 
can  be  directly  calculated.  The  phenomenon  persists  in  the  nonlinear  case. 
When  the  amplitude  of  the  driver  is  small,  there  is  again  a  threshold  frequency 
which  we  can  calculate.  For  larger  amplitudes  and  for  forcing  velocities  of  the 
form  xo(t)  =  2a+/(u>t),  where  /  is  2s’-periodic  and  u;  >  0,  it  is  not  yet  possible 
to  compute  the  threshold  frequency  u  =  uq  below  which  penetration  occurs. 
In  this  large  amplitude  situation,  uq  will  depend  on  the  shape  of  /. 

When  the  forcing  frequency  and  waveshape  are  such  that  oscillations  begin 
to  arise  in  the  chain,  a  dramatic  change  occurs  in  the  spectrum.  The  emitted 
eigenvalues  cluster  in  bands  as  is  seen  in  Figures  5b,  6b,  7b,  and  8b.  If  the  driver 
is  of  the  form  Xo(f )  =  2o  -I-  c/(f),  0  <  o  <  1,  c  small,  we  can  verify  numerically 
that  the  residual  state  of  the  chain  at  t  =  oo  is  described  precisely  by  solutions 
of  the  type  constructed  in  section  1(c),  which,  therefore,  constitute  the  set  of 
attractors  for  the  forced  chain  (1.1).  We  believe,  consistent  with  the  numerical 
evidence,  that  the  same  situation  occurs  when  e  is  large.  However,  we  have  not 
yet  been  able  to  construct  solutions  analogous  to  the  ones  of  section  1(c)  in  this 
case.  The  technical  difficulty  lies  in  connecting  the  boundary  forcing  at  n  =  0, 
through  a  boundary  layer,  to  the  j^-gap  algebraic/geometric  solution  at  large  n. 
We  note  again  that  as  the  frequency  of  the  driver  decreases,  more  wave-phases 
are  activated  and  correspondingly  more  gaps  appear  in  the  spectrum. 

In  Figures  5c,  and  7c,  we  plot  the  (integrated)  spectral  density,  i.e.  the 
number  of  eigenvalues  below  A  (=spectral  parameter),  divided  by  time.  In 
Figures  5d,  and  7d,  we  plot  the  same  densities  as  predicted  by  our  theory  that 
is  based  on  taking  the  continuum  limit  of  the  eigenvalue  evolution  equations  of 
Theorem  2.3.  Our  theory  is  not  yet  complete.  We  cannot  predict  the  endpoints 
of  the  band/gap  spectral  structure.  To  derive  the  predicted  densities  we  had 
to  use  the  numerically  obtained  values  for  the  endpoints. 


415 


•  I  ♦  0^  iWIt)  ♦0.1  «a«m  M.1. 1 


Figure  8(a) 


Figure  8(b) 


4  The  Continuum  Limit 

We  outline  our  main  result  in  tcLking  the  continuum  limit  of  the  eigenvalue 
evolution  equations  of  Theorem  2.3.  It  is  clear  numerically  that  eigenvalues  are 
emitted  from  the  continuous  spectrum  at  a  rate  that  is  constant  if  averaged  over 
a  large  period  of  time.  It  therefore  is  assumed  that  the  number  of  eigenvalues  in 
an  interval  grows  linearly  with  time.  The  density  function  ip{X)  is  defined  as  the 
difference  between  the  asymptotic  density  of  eignvalues  at  time  t  and  at  time 
zero  normalized  by  dividing  by  the  time  t.  In  other  words,  the  asymptotic 
(large  t)  number  of  eigenvalues  in  an  interval  (Ai,  A2)  minus  the  number  of 
eigenvalues  in  the  same  interval  at  time  t  =  0  is  given  to  leading  order  by 
the  integral  of  V'  over  the  interval,  multiplied  by  t.  We  derive  the  following 
continuum  limit  of  the  eigenvalue  dynamics: 

(4.1)  A—  <  ao  >  —  ^  /^log|A  —  =  0  when  A  is  in  B, 

V'(A)  =  0  when  A  is  not  in  B. 


(4.2)  rooV'(A)dA  =  0 


(conservation  of  eigenvalues). 


In  these  formulae  <  oq  >  is  the  mean  value  of  the  given  velocity  ao(t)  and 
the  set  ^  is  the  support  of  the  density  function  The  set  B  is  one  of  the 
principal  unknowns  of  the  problem.  The  density  function  ^  is  required  to  be 
nonnegative  in  regions  in  which  the  original  {t  =  0)  density  of  eigenvalues  is 
zero. 

We  utilize  the  fact  that  the  derivative  of  the  integral  kernel  in  (4.1)  with 
respect  to  A  is  the  integral  kernel  of  the  Hilbert  transform,  to  reduce  these 
equations  to  a  Riemann- Hilbert  problem,  which  we  can  solve  if  we  have  some 
additional  information  on  the  set  B.  This  additional  information  could  be 
gained  if  we  could  keep  a  higher  order  term  as  we  pass  to  the  continuum 
limit  in  our  derivation  of  equations  (4.1)  and  (4.2),  which  we  have  not  yet 
succeeded  indoing.  However,  once  B  is  known  from  the  numerical  experiments, 
we  can  solve  the  system  (4.1)-(4.2)  for  Integrating  we  obtain  the  spectral 
densities  of  Figures  5d  and  7d,  which  compare  favorably  with  the  spectral 
densities  computed  numerically  directly  from  the  data  and  displayed  in  Figures 
5c  and  7c. 
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Asymptotic  Model  for  Deflagration-to-Detonation 
Transition  in  Reactive  Two-Phase  Flow 

Pedro  F.  Embid* 


Abstract 

The  Deflagration-to-Detonation  Transition  (DDT)  in  energetic  granular  materials 
is  a  complex  multiphase  phenomena  involving  many  thermal  and  mechanical  effects 
among  the  solid  and  gas  phases.  Baer  and  Nunziato  introduced  a  continuum  system 
of  equations  in  their  study  of  DDT  in  reactive  two-phase  flows.  This  system  becomes 
resonant  when  the  compaction  wave  speed  of  the  solid  equals  one  of  the  gas-acoustic 
wave  speeds.  Combining  high-energy  activation  energy  asymptotics  with  nonlinear 
geometrical  optics,  we  develop  an  asymptotic  model  for  the  resonant  wave  interaction 
of  a  fast  moving  burning  front  and  one  of  the  gas-acoustic  waves.  This  model  is 
capable  of  predicting  in  qualitative  fashion  several  of  the  scenarios  documented  for 
DDT  through  large  scale  computations.  In  addition,  the  varios  scenarios  occur  for 
different  choices  of  parameters  in  the  asymptotic  model,  which  have  direct  physical 
interpretation  in  terms  of  tlie  asymptotic  procedure. 


1  Introduction 

Understanding  the  process  of  Deflagration-to-Detonalion  Transition  (DDT)  is  a  key  safety 
issue  for  the  industry  and  the  military  [4].  The  problem  is  particularly  complex  in  reactive 
multi-phase  media.  However,  there  are  several  identifiable  stages  in  DDT  [1,  4].  In  the 
initial  conductive  stage  the  flame  is  propagated  mainly  by  heat  diffusion.  Next  there  is  the 
convective  stage  where  the  flame  rapidly  grows  and  accelerates  through  heat  and  momentum 
transfer  between  the  solid  reactant  and  the  hot  product  gases.  This  is  followed  by  a  com¬ 
pressive  stage  characterized  by  further  acceleration  and  the  development  of  a  shock  in  the 
solid  phase.  Finally,  the  wave  continues  to  grow  to  a  fully  developed  detonation  wave.  This 
description  indicates  that  heat  and  momentum  transfer  between  the  solid  and  gas  phases  are 
important  mechanisms  in  DDT.  Among  these  transfer  effects  one  has  the  preheating  of  the 
solid  explosive  by  the  generated  hot  gases,  the  formation  of  zones  of  high  compaction  in  the 
solid  and  choking  of  the  gas  flow  in  the  convective  stage,  and  load  transfer  leading  to  shock 
formation  in  the  solid  in  the  compressive  stage. 

Recently,  Baer,  Nunziato,  and  their  collaborators  [1,  2,  3]  developed  a  complex  system 
of  equations  for  describing  transition  to  detonation  in  reactive  granular  materials.  Their 
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approach  involves  the  rational  continuum  theory  of  mixtures  and  a  systematic  use  of  the 
second  law  of  thermodynamics.  The  resulting  system  of  nonlinear  equations  involves  fluid 
mechanics  for  the  gaseous  phase  and  solid  mechanics  for  the  grains  coupled  together  through 
an  equation  for  the  volume  fraction  of  the  solid  as  well  as  source  terms  involving  the  bulk 
effects  of  surface  chemistry  and  compaction. 

Numerical  integration  of  this  system  of  equations  yields  excellent  qualitative  and  quan¬ 
titative  agreement  with  the  available  experimental  data  for  various  types  of  secondary  ex¬ 
plosives,  such  as  CP  and  HMX.  This  data  involves  the  trajectory  of  the  flame  front,  run 
distance  to  detonation  and  the  time  to  detonation  for  various  inital  values  of  the  precom¬ 
pacted  granular  bed  [1,  2,  3]. 

However,  an  important  theoretical  problem  consists  in  identifying  key  physical  mecha¬ 
nisms  that  are  responsible  for  the  formation  of  hot  spots  and  the  transition  to  detonation  in 
multi-phase  flows.  Rather  than  attempt  a  direct  study  of  the  formidable  system  for  reaw:tive 
multi-phase  flow,  we  develop  and  analyze  a  simplified  asymptotic  system,  valid  near  special 
resonant  flow  states  to  be  described  below.  We  then  show  that  this  asymptotic  model  can 
reproduce  in  qualitative  fashion  many  of  the  scenarios  of  DDT  documented  for  the  multi¬ 
phase  flow  system  through  the  use  of  large  scale  computing  [1,  2,  3].  This  paper  reviews  some 
of  the  results  I  obtained  in  collaboration  with  A.  Majda,  M.  Btter,  and  J.  Hunter  [6,  7,  8], 
and  it  is  organized  as  follows.  In  Section  2  we  present  briefly  the  characteristics  analysis 
for  the  reactive  multi-phase  flow  equations  of  Baer  and  Nunziato  and  exhibit  the  singular 
resonant  points.  In  Section  3  wc  discuss  the  asymptotic  model  for  hot  spot  formation  valid 
near  resonant  points.  Finally,  in  Section  4  we  analyze  within  the  context  of  the  tisymptotic 
model,  the  nonlinear  mechanisms  for  the  development  of  hot  spots. 


2  Characteristics  analysis  of  the  multi-phase  system 
and  resonant  points 

The  reactive  multi-phase  flow  equations  formulated  by  Baer  and  Nunziato  are  based  on  the 
continuum  theory  of  mixtures,  where  both  the  solid  and  gas  phases  are  assumed  compressible 
and  in  thermodynamic  non-equilibrium.  An  important  difference  between  their  formulation 
from  previous  systems  for  two-pha.se  flows  is  in  how  the  closure  of  the  system  is  obtained. 
In  addition  to  the  usual  saturation  constraint  and  equations  of  state  for  e.-ch  phase,  the 
solid  volume  fraction  is  recognized  as  an  independent  degree  of  freedom,  and  its  evolution  is 
rlescribed  by  an  equation  consistent  with  the  second  law  of  thermodynamics  [1]. 

More  specifically,  let  the  subscript  a  denote  either  the  solid(a=s)  or  the  gns{a=g);  the 
Baer-Nunziato  system  consists  of  the  following  7  evolution  equations  [1,2]  : 

Conservation  of  mass: 

’O^i.'PaPa)  "b  ’o^i^aPa^'a)  —  Cg  (1) 

Conservation  of  momentum: 

^(<f>apr.Vn)  +  —{(l>a{p,.vl  +  Pa))  -  Pg^  =  m|  (2) 
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Conservation  of  energy: 

^(^aPo/?a)  +  +  PaK)  “  Pa~^^*  =  «!  (3) 

Compaction  equation  for  the  solid  volume  fraction: 

X  -fi+f  /4\ 

di  ^^•dx 

Saturation  constraint: 

+  0a  =  1  (5) 

Equations  of  state  for  each  phase: 

f'a  =  ^a{PaiPa)  •  (6) 

Associated  to  the  a-phase  is  tlie  velocity  material  density  pa,  pressure  po,  temperature 
To,  internal  energy  ta  and  volume  fraction  0a-  The  total  energy’  is  given  by  £„  =  €„  +  v^/2. 

For  simplicity  we  assume  that  both  phases  are  described  by  equations  of  state  of  ideal 
gas  type 

e  = 

“  Para 

c  ^7) 

rry 

with  c“,  Fa  constants.  Fa  is  the  Gruneisen  coefficient  and  it  is  related  to  the  7-gas  constant 
7a  hy  Fa  =  7a  -  1.  Representative  values  for  the  gas  and  the  solid  are  Fj  =  0.2  and  F,  =  -3. 

Next  we  discuss  the  phase  interaction  terms  cl,  ml  and  el.  First  of  all,  since  mass, 
momentum  and  energy  for  the  mixture  have  to  be  conserved,  the  phase  interaction  terms 
must  satisfy  the  constraints 


The  reaction  term  d  gives  the  rate  of  depletion  of  the  solid  due  to  surface  burning  and  for 
simplicity  we  assume  a  one-step  forward  reaction  with  Arrhenius  kinetics 


here  A  is  the  activation  energy,  T,  is  a  reference  temperature,  and  A'  is  the  pre-exponential 
factor.  All  are  assumed  to  be  constants.  T  is  the  mixture  temperature  and  it  is  given  by 

T  =  0,T,  -I-  <(>gTg. 
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The  second  term  fc  in  the  compaction  equation  represents  changes  in  volume  fraction 
due  to  pressure  differences,  and  it  is  given  by 


fc  =  ^(P.  -  P,  -  MM) , 
f*c 


(10) 


where  /<c  is  a  compaction  viscosity  coefficient  and  represents  intergranular  stress  due 

to  grain  contact. 

The  phase  interaction  terms  ml  and  cl  for  the  exchange  of  momentum  and  energy  are 
given  by 


mi 


t  _ 


-  M  + y)  + 


(11) 


The  terms  in  Eq.  11  involving  cl  and  fc  represent  exchange  in  momentum  and  energy  due  to 
chemical  reaction  and  compaction  respectively.  6(t?,  —  u^)  represents  changes  in  momentum 
due  to  drag,  and  h(Ts  —  Tg)  represents  changes  in  energy  due  to  convective  heat  transfer 
between  the  gas  and  the  solid. 

Next  we  discuss  the  characteristics  for  the  reactive  multi-phase  system.  All  the  algebraic 

details  can  be  found  in  [6,  7].  With  u  =  {psiVa,Psi4>a^Pg>Vg,Pg)^ ,  Eqns.  1-4  can  be  written 

in  matrix  form  as  .... 

du  .  ,  ^du 


Ao(«)^  +  A,(u)^  =  5(u), 


(12) 


where  i4o(u)  and  Ai(u)  have  the  block  structure 

Ao(tt)  = 

A,(«)  = 


/  4>3^fo{Us) 

/o(«.) 

0 

1 

0 

\  0 
/  <r>iV/l(l<,) 

-foM 
fiUh)  4 

h 

e(?i) 

(“fl)  / 

0 

Vs 

0 

\  0 

-Mug)-e{u) 

<t>9^fMg)  / 

(13) 


with  /o(«o),  /i(«a)  and  e(u)  given  by  (a  =  s,g): 

/o(«a)  =  {Pa,PaVa,PaEaV  =  S-,9 

/l(«a)  =  {PaVa,PaVl  +  Pa,{paEa-\-Pa)VaY,a  =  S,g 

e(it)  =  (0,-p5,-pgr,)^. 


(14) 
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The  characteristic  analysis  of  the  system  is  determined  by  the  study  oi  the  dgenvalues  Aj(u) 
and  their  associated  right  eigenvectors  rj(u),  j  =  I,...,?,  of  the  matrix  equation 

(~Aj(u)y4o(u)  +  /4,(M))r^(u)  =  0  ,  (15) 


and  they  arc  given  by: 
Eigenvalues 


Al  =  Vg  +  Cj,  Aj 

A5  Vgf  Ag 


I'*,  ''a 

+  c.,  A7 


Aa  = 


V*,  A4 

V,  -  c. 


—  Vg  Cg, 


Right  Eigenvectors 


ri 


=  ^0,0,0,0,l,^,cn 

= 

V  <p»i 


ra  =  (1,0,0,0,0,0,0)^ 

=  ^0,0,0,0, 

rs  =  (0,0, 0,0, 1,0,0)^ 

rg  =  ^l,^,cJ,0,0,0,oj 
rr  =  ^l,-^,c^0,0,0,0j  . 


(16) 


(17) 


Clearly  the  system  is  hyperbolic,  with  the  wave  speeds  corresponding  to  the  standard  acoustic 
and  particle  speeds  for  each  one  of  the  phases.  This  fact  is  important  both  from  the  physical 
and  computational  grounds.  In  fact,  several  of  the  older  models  proposed  for  multi-phase 
flows  exhibited  complex  characteristics  and  elliptic  regions  at  low  speed  regimes  [5,  13].  This 
triggers  catastrophic  instabilities  of  Hadamard  type  at  the  initial  deflagration  stage  and 
invalidates  the  results  of  numerical  calculations.  The  right  eigenvectors  of  the  system  are  also 
the  standard  acoustic  and  entropy  modes  for  each  phase,  with  the  exception  of  the  eigenvector 
rj  does  not  have  an  analog  in  one-phase  flow  and  corresponds  to  compaction  waves.  Although 
the  system  is  hyperbolic  it  is  not  strictly  hyperbolic  because  v,  is  a  double  eigenvalue,  and 
also  because  the  wave  speeds  associated  with  each  phase  change  independently  of  each  other, 
so  that  the  eigenvalues  of  the  solid  phase  can  coincide  with  any  of  those  for  the  gas  phase. 
However,  the  system  is  totally  hyperbolic:  it  has  a  complete  family  of  associated  right 
eigenvectors,  except  at  special  singular  points.  These  singular  resonant  points  are  realized 
when  one  of  the  acoustic  signals  for  the  gas  equals  the  solid  particle  speed:  u,  =  ±  Cg.  In 
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this  case  the  compaction  mode  degenerates  'nto  one  of  the  gas-acoustic  modes: 


lim  ra(u)  =  rJu) 
lim  rjlu)  =  r4(u) . 


(18) 


The  singular  resonant  points  Vg±Cg  =  i>,  are  also  known  as  choked  flow  points  [6,  7]  because 
at  the  pore  level  they  can  be  interpreted  as  choking  of  the  gas  flow  through  the  pore  space 
between  grains.  For  definiteness  consider  the  first  case  where  ri  and  rj  are  aligned.  The 
other  case  is  analogous.  In  this  case  u,  is  a  triple  eigenvalue  with  only  two  eigenvectors  ri 
and  ra.  In  fact,  there  is  now  a  2  x  2  Jordan  block  structure  coupling  the  acoustic  eigenvector 
ri  with  a  generalized  compaction  eigenvector  which  we  still  denote  by  rj: 


(-A/1o(m)  +  ^i(w))ri  =  0 
(-A/lo(u) -1- /li(ti))r2  =  -Ao{u)ri 
(-A/lo(tt)  + /li(«))r3  =  0, 


where  r2  is  given  explicitly  by 


r2 


'Ug  P,  -  Pg 
Pg^g 


PgCg 


(19) 


(20) 


The  algebraic  structure  displayed  in  Eq.  19  already  puts  in  evidence  the  resonant  character 
of  the  choked  flow  points  through  the  coupling  of  the  gas-acoustic  and  the  compaction  eigen¬ 
vectors.  However,  the  solid  entropy  mode  remains  decoupled  from  the  other  two.  Therefore 
near  choked  flow  conditions  there  is  strong  nonlinear  interaction  due  to  the  coupling  the 
gas-acoustics  and  compaction  modes:  it  is  near  these  resonant  points  that  the  simplified 
asymptotic  mo<lel  will  be  formulated  m  the  next  section. 


3  The  asymptotic  model  near  a  resonant  state 

The  full  derivation  of  the  asymptotic  model  near  resonant  points  is  given  in  [7].  The  in¬ 
terested  reader  can  study  the  <letails  of  the  length}'  derivation  there.  Here  we  just  want 
to  comment  on  the  physical  and  mathematical  basis  of  the  derivation  and  connections  of 
the  asymptotic  model  and  the  continuum  reactive  multi-phase  system.  The  purpose  of  the 
asymptotic  model  is  to  descriljo  the  nonlinear  resonant  interaction  of  the  gas-acoustics  and 
compaction  modes  at  the  convective  stage  of  DDT,  with  a  fast  moving  wave  satisfying  near 
choked  flow  and  ignition  temperature  conditions.  The  physical  mechanisms  singled  out  are 
the  resonance  of  the  ga.s-acoustic  and  compaction  modes,  the  burning  of  the  solid  explosive, 
and  the  convective  heat  transfer  from  the  hot  gas  products  to  the  solid  combustible.  The 
niathematical  derivation  of  the  model  is  asymptotic  in  nature  and  combines  the  methods  of 
nonlinear  geometrical  optics  with  large  activation  energy  asymi)totics  [9,  10,  11,  12].  There¬ 
fore  we  assume:  1 .  a  resonant  background  state  iiq  with  e®  =  e“  -|-  c®.  2.  near  ignition 
temperature  conditions  for  the  background  state.  3.  Large  activation  energy  for  the  reac- 
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tion,  and  4.  weak  chemistry  for  the  reaction.  This  last  requirement  is  due  to  the  small 
amplitude  nature  of  the  asymptotic  approximation.  The  reactive  multi-phase  equations  can 
be  cast  in  nondimensional  form  as  follows  (see  [7]  for  the  details  of  the  scaling  procedure): 

The  relevant  nondimensional  parameter  c  is  proportional  to  the  inverse  of  the  activation 
energy,  so  that  c  <<  1.  The  nondimensional  matrices  i4o(u)  and  i4i(u)  have  the  same  form 
as  in  Eqs.  12-13.  In  the  nondimensional  source  term  S(u,uo,e)  the  dominant  effect  is  due 
by  heat  convection,  followed  by  chemical  reaction  and  drag: 


where 


c5(u,uo,c)  =  c5‘(u)  +  c^[W(u,«o,c)5^(u)  -f  5*(u)]  -f  e^S^u) , 
5‘{u)  =  h(T,  -  r,).(0, 0,-1, 0,0, 0,1)^ 

5^(„)  =  -1, 

W(ti,Uo,<)  =  ,ro=l 

=  6(u,  -  Ug).(0,  -l,-t»„  0,0,1,  t>,)^ 


5^(«)  =  /c<(0,0,  -P,  +  /3„l,0,0,p,  -4*)^ 

/c(*0  ~  ^c^*0j(p*  Pa  ~  • 

In  the  nonlinear  geometrical  optics  approximation  we  assume  small  amplitude  high  frequency 
waves  around  the  background  resonant  point  «o  with  resonant  speed  A  =  t>°  =  With 

the  fast  variable  0  =  (.t  —  At)/€  the  ansaizoi  nonlinear  geometrical  optics  becomes 

=  Wo  +  «(«r(f)  +  +  •••)  +  (24) 


Here  u\{t)  represents  the  mean-field  effects  due  to  convective  heat  transfer,  (Ti{0,t)  is  the 
gas-acoustic  perturbation  amplitude,  and  <T2(0,  <)  is  the  compaction  perturbation  amplitude. 
The  resonant  character  of  the  approximation  is  already  evident  at  the  formal  asymptotic 
level,  where  the  order  compaction  term  resonates  with  the  order  e  gas-acoustic  term.  We 
also  omitted  for  simplicity  the  contributions  of  the  solid  entropy  mode  in  the  asymptotic 
approximation  because  from  the  algebraic  structure  of  the  system  at  resonant  points  given 
in  Eq.  19,  it  is  apparent  that  the  coupling  of  the  solid  entropy  mode  with  the  other  two  is 
going  to  weak,  and  only  through  the  source  term.  Nevertheless,  the  solid  entropy  mode  is 
considered  in  the  derivation  given  in  [7]. 

The  initial  data  is  a  perturbation  around  the  resonant  background  state  Wq 

w'(.r,0)  =  iio  +  t  ^w‘  -I-  <7®  r,^  -|-  eV”  r2  .  (2-5) 


where  the  initial  mean  field  perturbation  w*  at  order  c  is  given  by  u’  =  (pj,  pj,  , 

rj  is  given  by  Eq.  17  and  ri  by  Eq.  20.  Without  going  into  the  lengthy  details  of  the 
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derivation  given  in  [7],  the  resulting  asymptotic  system  for  the  resonant  interaction  of  the 
gas-acoustic  and  compaction  mode  is  given  by: 

/p<\  /  +  0(e») 

v;  +  evl  +  O(e^) 

Pi  Ps  +  ^  (p1  +  <r.A(7;  -  7f)M^)  +  O(e^) 

41  =  4>^,  +  e4\-t^24y,iej)/py,  +  Oie^)  ,  (26) 

pI  />° +  «(?]  + 

^9  ^  O)  /Pj  +  0{e^) 

<  p*  /  I  pS + ^  (pi  -  t^MT;  -  7?)/^; + (c;)v,((?,t))  o(c^) , 

where  the  gas-acoustic  amplitude  <Ti{d,t)  and  the  compaction  amplitude  <T2iO,t)  satisfy  the 
simplified  asymptotic  equations: 

Gas-acoustic  mode: 

gjr  +  55(('"-‘-<)<T,+ciYj--5j-  +  «i-5p-  (27) 


Compaction  mode: 


=  /?exp(a,<Ti  +/3i  +  fi2t)y 


Eqs.  27-28  have  been  written  in  a  coordinate  system  that  makes  <72  stationary.  The  coeffi¬ 
cients  in  Eqs.  27-28  are  given  explicitely  by 
Wave  speed  coefficients 


,c^/pi  pI\ 

^  V„'  ' 


430 


Source  term  coefficients 


oi  = 


= 


^2  = 
R  = 


3  >  0 

2<A« 


(30) 


From  Eq.  26  it  follows  that  an  increase  of  the  gas-acoustic  amplitude  <ti  corresponds  to 
the  increase  of  the  physical  variables  pg  and  pg.  Similarly,  an  increase  of  the  compaction 
cimplitude  aj  corresponds  to  a  decrease  of  the  volume  fraction  <f>a  of  the  solid,  i.e.  to  burning 
of  the  solid.  These  will  be  relevant  when  studying  the  numerical  solutions  of  the  asymptotic 
system  in  the  next  section. 

Of  all  the  parameters  in  Eqs.  29-30  the  most  relevants  in  our  analysis  are  bi  and 
Both  parameters  include  convective  heat  transfer  effects  from  the  background  state  Uq.  bi 
controls  the  linear  recesion  speed  from  the  burning  front  in  Eq.  27.  If  6i  >  0  then  the  there 
is  drift  away  from  the  burning  front  and  the  opposite  occurs  if  bi  <  0.  It  is  clear  from  Elq. 
29  that 

6i>0  iff  T;>T^,  (31) 

that  is,  if  at  the  background  state  the  product  gases  preheat  the  solid  by  heat  convection.  5i 
also  is  relevant  in  studying  the  resonant  behavior  that  the  asymptotic  system  inherits  from 
the  multi-phase  equations.  The  measure  of  how  near  the  flow  is  from  resonance  is  given 
by  the  relative  Mach  number  M  defined  by  M  =  (vs  —  Vg)fcg.  In  terms  of  the  asymptotic 
approximation  in  Eq.  26  is  given  by 


M  =  I  +  e(b\t  -  ci<7i  —  oi)/c°  -f  O(e^). 


(32) 


Hence,  if  a  burning  front  initially  has  M  <  1  but  6i  >  0  we  expect  that  it  will  accelerate 
and  go  through  the  resonant  state  M  =  1  in  finite  time.  In  the  next  section  we  will  specify 
the  initial  data  so  that  M  <\. 

The  other  relevant  parameter  is  /?2-  From  Eq.  28  it  follows  that  the  chemicla  reaction 
is  accelerated  if  >  0  and  inhibited  otherwise.  Therefore  ^2  is  an  important  parameter  to 
consider  in  the  creation  of  hot  spots.  From  E)q.  30  it  follows  that 

/?2>0  iff  (33) 

that  is,  in  order  to  enhance  the  reaction  the  gas  needs  to  be  hotter  than  the  solid  but  the 
temperatures  cannot  be  too  disparate,  with  a  bound  of  their  ratio  in  Eq.  33  given  by  the 
Gruneisen  coefficients.  With  F,  «  3.0  and  Fg  0.2  we  have  an  upper  bound  in  Eq.  33  of 
the  order  of  60. 
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4  Hot  spot  development  in  the  asymptotic  model 

In  this  section  we  discuss  the  qualitative  predictions  that  can  be  made  with  the  <isymptotic 
model.  The  discussion  is  based  on  the  paper  by  Embid  and  Majda  [8]  where  the  interested 
reader  is  referred  for  a  more  complete  discussion  of  the  cases  presented  here  as  well  as 
additional  cases  of  interest.  The  reader  can  find  also  in  [8]  a  discussion  of  the  numerical 
method  used  to  solve  the  asymptotic  system  as  well  as  interesting  analytical  solutions  of  the 
equations  for  the  non-reactive  case.  For  the  experimental  set-up  we  consider  the  situation 
were  a  fast  moving  burning  front  is  initially  near  resonance.  More  specifically,  we  assume 
at  the  start  a  fairly  stiff  situation  were  there  is  no  gas-acoustic  disturbances  produced,  so 
that  aiO,  0)  =  0,  and  the  compaction  mode  is  a  monotonic  decreasing  profile  as  depicted  in 
Fig.  1,  so  that  there  is  more  compaction  on  the  right.  The  initial  data  is  chosen  so  that  the 
wave  is  subsonic  (A/  <  1)  but  near  resonance.  From  Eqs.  29  and  32  it  is  clear  that  this  can 
be  achieved  with  «o  a  resonant  state  and  by  adjtisting  the  initial  mean  field  correction  so 
that  oi  >  0.  With  this  prescription  of  the  initial  data  we  consider  three  cases  of  interest.  In 
case  1  there  is  no  chemical  reaction.  Case  2  has  chemical  reaction  enhanced  with  a  choice 
of  02  >  0.  Finally  case  3  has  inhibited  reaction  with  02  <  0.  The  numerical  solution  of  the 
Eqs.  27-28  was  done  with  operator  splitting  and  a  higher  order  Godunov  scheme  for  the 
nonlinear  wave  equation.  For  the  details  the  reader  is  referred  to  [8]. 

Case  1:  72  =  0,  6i  =  5  (no  reaction).  In  this  case  it  is  clear  from  Eq.  28  that  the 
compaction  amplitude  o-j  does  not  change  in  time  and  it  remains  the  monotonic  decreasing 
profile  in  Fig.  1.  On  the  other  hand,  the  inhomogeneities  in  <72  act  as  a  source  term  for 
the  gas-acoustic  amplitude  in  Eq.  26.  Overlays  of  the  gas-acoustic  amplitude  in  time  are 
depicted  in  Fig.  2.  From  an  initial  uniform  zero  state  merges  a  wave  that  grows  initially  in 
amplitude.  Because  6i  >  0  tiie  wave  moves  to  the  left  of  the  burning  front.  From  Eq.  32  the 
wave  also  becomes  resonant  in  finite  time  and  develops  a  shock  wave.  Afterwards  the  wave 
amplitude  decays  in  time. 

Case  2:  72  =  1,  6i  =  5,  02  =  2  (enhanced  reaction).  Overlays  in  time  of  the  solution 
for  the  amplitudes  <ti  and  (Jj  are  depu'cd  in  Figs.  3(a)  and  3(b).  At  the  initial  stages  from 
/  =  0.0  to  t  =  2.0  the  behavior  of  (Tj  is  similar  to  case  1  with  a  shock  wave  moving  to  the  left 
of  the  burning  front.  .A  this  stage  shows  essentially  uniform  burning  throughout  but  with 
the  presence  of  small  disturbances  produced  by  the  left  moving  gas-acoustic  shock  wave. 
However,  by  time  t  =  2.5  the  feedback  ?nechanism  built  in  the  asymptotic  sy.stem  produces 
amplification  of  the  gas-aconstic  mode  and  t  he  formation  of  a  region  of  relative  compaction 
in  az-  The  resonant  feedback  mechanism  enhances  the  growth  of  this  gas-acoustic  hot-spot 
and  at  the  same  time  creates  a  zone  of  enhanced  burning  of  the  solid  ahead  of  the  region  of 
relative  compaction,  represented  by  the  spike  in  <T2  at  times  I  —  2.8  and  2.81.  At  this  stage 
the  resonant  feedback  between  both  modes  is  very  strong  and  induces  the  very  rapid  growth 
of  the  hot  spot.  Shortly  after,  by  time  /  =  2.816  and  at  location  .r  =  —14.9,  the  maximum 
measured  amplitude  for  a\  ia  about  3.3  x  10'*  while  the  amplitude  for  cxz  yields  values  of 
(machine)  infinity.  Clearly,  at  this  point  we  are  beyond  of  the  regime  of  validity  of  the 
asymptotic  approximation  and  possibly  other  physical  effects  not  incorporated  in  the  model, 
such  as  compression  of  the  solid  maj’  be  relevant.  We  point  out  that  the  behavior  described 
in  this  case  is  qualitatively  similar  to  the  situation  documented  in  [1]  for  the  combustion  of  a 
column  of  IIMX  with  95  %  initial  density.  The  numerical  solution  of  the  reactive-multiphase 
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equations  shows  the  birdth  and  growth  of  a  hot  spot  during  the  convective  stage  of  DDT 
(from  10-20  nsec,  and  this  hot  spot  moves  away  from  the  compaction  front  at  a  linear  speed, 
in  accordance  with  the  predictions  of  the  asymptotic  model.  We  also  remairk  that  in  [8] 
we  discuss  other  parameter  choices  were  the  hot  spot  grows  within  the  compaction  burning 
front.  This  situation  has  also  been  observed  in  the  multi-phase  flow  equations  (1,  2,  3]. 

Case  3:  /?  =  1,  6i  =  5,  =  —0.1  (inhibited  reaction).  Overlays  in  time  of  the  solution 

for  the  amplitudes  a\  and  crj  are  depicted  in  Figs.  4{a)  and  4(b).  In  this  case  we  notice  the 
strong  inhibiting  effect  that  a  negative  value  of  ^2  has  on  the  resonance  mechanism  and  the 
creation  of  hot  spots.  In  this  case  we  notice  that  the  gas-acoustic  solution  depicted  in  Fig. 
4(a)  does  not  develop  a  hot  spot.  In  fact,  the  solution  is  almost  identical  to  the  nonreactive 
case  depicted  in  Fig.  2.  Also  revealing  is  to  observe  the  behavior  of  the  compaction  amplitude 
in  Fig.  4(b).  Here  the  compaction  wave  goes  through  essentially  a  very  slow  and  uniform 
burning  (compare  with  Fig.  3(b)),  with  the  development  at  the  initial  stages  of  small  small 
disturbances  that  cannot  amplify  and  rapidly  die  out.  Situations  qualitatively  similar  to 
the  one  described  here  are  observed  in  stable  fast  deflagration  waves  that  do  not  transit  to 
detonation  [3]. 

5  Conclusions 

We  have  studied  a  state  of  the  art  continuum  mixture  theory  system  of  equations  for  reactive 
multi-phase  flow  utilized  in  the  study  of  DDT.  We  identified  interesting  singular  states  for 
the  flow  and  using  the  methods  of  nonlinear  geometrical  optics  and  high  activation  energy 
asymptotics,  derived  a  simplified  asymptotic  model  to  study  the  formation  and  growth  of  hot 
spots.  We  showed  how  by  chosing  different  values  of  selected  parameters  one  can  reproduce 
in  qualitative  fashion  different  scenarios  for  DDT  documented  in  the  literature.  By  design 
the  model  applies  only  on  the  small  amplitude  regime  and  provides  no  insight  on  the  later 
stages  of  DDT  process.  On  the  other  hand,  the  model  provides  interesting  parameter  regimes 
with  physical  significance  that  indicate  whether  or  not  hot  spots  will  develop  and  the  relevant 
physical  mechanisms  responsible  for  it. 
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Figure  Captions 


Fig.  1  Initial  burning  wave  profile  for  the  compaction  mode  a-i 

Fig.  2  Overlays  of  the  gas  acoustic  amplitude  ffi  for  the  nonreactive  wave  with  heat  con¬ 
vection.  Case  6i  =  5.  Snapshots  recordotl  from  <  =  0.0  to  <  =  10.0  at  time  intervals 
At  =  1.0. 

Fig.  3  Overlays  for  a  reactive  wave.  Case  hi  =  5,  /?2  =  2.  (a)  Amplitude  (Tj.  (b)  Amplitude 
<r2.  Snapshots  recorded  at  times  t  =  0.0,  0.5,  1.0,  1.5,  2.0,  2.5,  2.6,  2.7,  2.8,  and  2.81. 

Fig.  4  Overlays  for  a  reactive  wave.  Case  hi  =  5,  /32  =  —0.1.  (a)  Amplitude  <ri.  (b) 
Amplitude  <T2.  Snapshots  recorded  from  t  =  0.0  to  t  =  10.0  at  time  intervals  At  =  1.0. 
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THERMIONIC  EMISSION  FROM  HIGH-Tc  SUPERCONDUCTORS 
Richard  A.  Weiss 

U.  S.  krwy  Engineer  Waterways  Experinent  Station 
Vicksburg,  Mississippi  39180 

ABSTRACT.  The  superconducting  state  of  a  high-Te  naterial  with  T  <  T^  is 
described  by  a  coherent  spacetisM  state  in  which  the  electrons  of  a  Cooper  pair 
rotate  coherently  or  incoherently  in  the  azimithal  angle  space  and  time 
variables,  and  move  coherently  in  the  radial  space  and  time  variables.  In  this 
sense  the  electrons  of  a  Cooper  pair  are  localized  in  the  radial  space  and  time 
coordinates.  The  normal  state  of  a  high-Te  material  with  T  >  Te  occurs  «dien  the 
electrons  in  a  Cooper  pair  are  in  a  partially  coherent  radial  spacetime  state. 
The  equilibrium  equations  for  a  pair  of  Cooper  electrons  are  formulated  for  tne 
coherent  radial  spacetime  conditions  of  the  superconducting  state,  and  it  is 
found  that  for  a  weak  attractive  Coulomb  pairing  force  the  rotation  is  non- 
Keplerian  in  the  sense  that  the  angular  frequency  of  rotation  is  independent  of 
the  separation  distance  of  the  two  electrons  of  the  pair.  Applications  to  the 
theory  of  thermionic  emission  from  high-T^  materials  are  considered,  and  the 
thermionic  emission  current  is  calculated  for  the  normal  and  superconducting 
states.  The  thermionic  emission  current  for  the  normal  state  of  a  high-Te 
material  is  found  to  be  given  by  a  modified  Richardson- Dtishman  equation  with  T^ 
still  as  the  leading  temperature  dependent  term,  %fhile  the  thermoemission  from 
the  superconducting  state  is  given  by  a  totally  different  expression  which  does 
not  have  the  leading  quadratic  temperature  term  but  instead  can  have  leading 
temperature  dependent  terms  of  the  form  T“,  T^^^,  T  or  T®^*. 

1 .  INTRODUCTION .  The  discovery  of  high-Tc  superconductors  brings  the 
promise  of  the  development  of  a  new  technology  in  communications,  power 
generation  and  transmission,  and  mass  transportation.^'^  High-T^  superconductors 
for  instance  will  lead  to  stronger  electromagnets  for  research  and  industry.  The 
prospective  uses  are  limitless  if  the  high-Te  materials  can  be  found  that  have 
important  material  properties  such  as  ductility,  malleability,  strength  and  the 
ability  to  maintain  the  superconducting  state  in  the  presence  of  the  magnetic 
field  generated  by  the  current  flowing  in  the  high-Te  superconductor.  The 
materials  have  to  be  designed  to  exhibit  these  useful  engineering  properties  in 
addition  to  having  the  property  of  high-Te  superconductivity.  Trial  and  error 
design  of  such  substances  is  of  value,  but  considering  the  magnitude  of  the 
nuiid}er  of  inorganic  and  organic  compounds  that  need  to  be  investigated  it  is 
clear  that  the  trial  and  error  approach  should  be  supported  by  an  analytical 
method  for  the  design  of  high-Te  compounds  having  engineering  value. 

New  material  design  requires  an  understanding  of  the  chemical  and  physical 
processes  that  occur  in  a  material  to  produce  a  high-Te  superconducting  state. 
Unfortunately  the  underlying  mechanism  of  high-Te  superconductivity  is  not 
completely  understood  for  the  layered  oxide,  organic,  or  heavy  fermion  type  of 
superconductors.  This  paper  develops  a  theory  of  the  thermionic  emission  from 
high-Te  materials  which  can  possibly  lead  to  the  development  of  an  analytical 
understanding  of  the  properties/structure  relationship  for  these  materials. 
Specifically,  the  paper  investigates  the  motion  of  electrons  in  Cooper  pairs  that 
form  in  high-Te  superconductors,  and  calculates  the  various  forms  of  the  therm- 
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ionic  emission  current  fron  high-Tg  materials.  The  motion  of  electrons  in 
hi^-Tg  materials  are  subject  to  the  broken  symmetry  nature  of  spacetime  in  these 
materials,  and  the  electrical  properties  of  a  high-Te  material  will  depend  on  the 
type  of  broken  spacetime  symsietry  that  occurs  in  the  Cooper  electron  pairs.  The 
outline  of  this  paper  is  as  follows:  Section  2  considers  the  non-Keplerian  mo¬ 
tion  of  the  electrons  in  a  Cooper  pair  in  the  superconducting  state  of  a  hi^-Te 
material,  and  Section  3  develops  a  theory  of  the  thermionic  emission  from  the 
normal  and  superconducting  states  of  high-T^  materials. 

As  in  conventional  BCS  theory,  high-T^  superconductivity  is  generally  in¬ 
terpreted  as  being  due  to  a  broken  gauge  symmetry  associated  with  the  formation 
of  Cooper  electron  pairs  in  the  ground  state.*'*  The  electron-phonon  interaction 
probably  is  responsible  for  the  attractive  force  between  two  electrons  that  are 
located  in  the  vicinity  of  lattice  atoms.*'*  However,  these  concepts  have  not 
been  able  to  explain  the  large  values  of  the  measured  normalized  superconductivi¬ 
ty  energy  gaps  in  high-T,,  materials  which  have  the  following  range  of  values^'*-*' 

3  <  2A/(kTc)  <  8  (1) 

The  large  values  of  the  measured  normalized  superconducivity  energy  gaps  for 
high-Tc  materials  has  been  interpreted  as  being  due  to  a  strong  binding  of  the 
electrons  in  a  Cooper  pair.*-'*  In  fact  the  BCS  theory  of  weakly  interacting 
electrons  in  a  Cooper  pair  predicts  that  the  normalized  superconductivity  energy 
gap  is  given  by 

2A/(kT,)  -  3.52  (2) 

where  in  this  case  the  half-width  gap  A  refers  to  absolute  zero  temperature. 

Recently  a  broken  spacetime  symmetry  theory  of  high-Tc  materials  was 
developed  that  assumes  that  for  the  superconducting  state  the  space  and  time 
coordinates  of  the  electrons  in  a  Cooper  pair  change  by  a  coherent  rotation  in 
an  internal  space,  i.e.  space  a.  d  time  are  localized  for  the  electrons  in  Cooper 
pairs. This  concept  predicts  that  if  the  two  electrons  in  a  Cooper  pair  are 
weakly  interacting  the  normalized  superconductivity  energy  gap  is  nevertheless 
not  given  by  the  BCS  value  but  instead  has  the  value^®*^* 

2A/(kTJ  -  6A(3.52)(1  -  4A  (3) 

where  8^  -  internal  phase  angle  of  the  relative  acceleration  of  the  two  electrons 
in  a  Cooper  pair.  The  factor  6/jr  arises  from  the  condition  of  superconductivity 
in  a  broken  spacetime  symmetry  theory  of  Ohm's  law  and  the  assumption  of  free 
electrons  (weak  interaction)  which  require  that  the  internal  phase  angle  of  co¬ 
herent  time  has  the  value  Sj,  -  7r/6.^^  The  normal  state  of  a  high-T^  material  is 
interpreted  to  be  a  partially  coherent  spacetime  state. Laboratory  me  sure - 
ments  using  a  variety  of  experimental  techniques  on  various  types  of  hi  h-T^ 
materials  has  yielded  values  of  the  normalized  superconductivity  energy  g^p  that 
are  larger  than  the  BCS  value  of  3.52  as  indicated  by  equations  (1)  and  (2).^'®’ 
9-12,15,16  xhis  is  true  for  the  common  oxide  high-Tg  materials  and  for  the  alkali 
metal  doped  Cgo  fullerines .  These  large  values  of  the  superconductivity  energy 
gap  are  due  to  the  factor  6/jr  that  is  associated  with  the  coherent  radial  motion 
of  the  electrons  in  the  Cooper  pairs  of  a  weakly  interacting  high-T^  materi- 
al.i3-» 


within  Che  weakly  coupled  electron  gas  of  a  high~T(.  material  the  space  and 
time  coordinates  are  complex  numbers  in  an  internal  space  and  can  be  written  as 

17,18,20 


t^  exp(je^) 

(4) 

V  exp(j0^) 

(5) 

where  v  ■  x  ,  y  ,  z  for  cartesian  coordinates,  v  »  r  ,  ,  z  for  cylindrical  polar 

coordinates,  and  v  >■  p  ,  ^  ,  i|)  for  spherical  polar  coordinates.  Strictly  speaking 
a  time  coodinate  is  associated  with  each  space  coordinate  as  in  equations  (4)  and 
(5),  but  for  homogeneous  spacetime  it  follows  that 


t  = 
V 


t 


(6) 


for  all  V  .  Essentially  Che  broken  symmetry  of  spacetime  can  be  deduced  from  a 
relativistic  trace  equation  for  matter  and  energy. The  differentials  of 
the  time  and  space  coordinates  can  be  obtained  from  equations  (4)  and  (5)  to  be 

17,18,20 

dt^  =  sec  dt^  expCj*^)  =  esc  B^^ 
dv  =  sec  6  dv  exp(i$  )  =  esc  B  vdB  exp(j<l>  )  (8) 

where 


tan  B^.  =  t  9e^/3t 

tt  V  t  V 

(9) 

tan  B  =  v30  /9v 

VV  V 

(10) 

V  V  V 

=  0^  +  B,., 

(11) 

t  t  tt 

<l>  =  0  +  B 

V  V  VV 

(12) 

Then  the  magnitudes  of  the  differentials  of  space  and  time  are  given  by^^'^®’^° 
Idtyl  =  sec  B^,.  dt^  =  CSC  B^^  t^dB^  (13) 

Idvl  =  sec  6  dv  =  esc  6  vd6  (14) 

I  I  '^VV  VV  V 

17  18  20 

and  the  measured  values  of  the  time  and  space  coordinates  are  given  by  »  ’ 

t  =  t  cos  6^  (15) 

vm  V  t 

V  =  V  cos  6  (16) 

m  V 

where  v  =  x,y,z;  r,4),zorp,<ti,ti'. 
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The  velocity  and  raomentuB  components  of  a  particle  moving  in  broken  sym¬ 
metry  spacetime  must  also  be  written  as  complex  numbers  as  follows^^*^* 

V  «  V  exp(je  )  “  dv/dt  (17) 

y  y  VV  V 

p  ■  p  exp(je  )  =  mv  (18) 

where  v  -  x  ,  y  ,  z  .  The  momentum  magnitudes  can  in  general  be  written  in  four 
equivalent  ways  that  can  be  used  to  pass  to  the  four  possible  limiting  symmetry 
states  of  spacetime  namely,  incoherent  space  and  incoherent  time,  coherent  space 
and  Incoherent  time,  incoherent  space  and  coherent  time,  and  coherent  space  and 
coherent  time^^»^®*^° 

p  =  mv  =  m  sec  8  cos  dv/dt  (19) 

‘^V  V  VV  tt  V 


■  m 

CSC 

^v 

cos 

V  d6 

«  m 

sec 

®vv 

sin 

t-' 

V 

=  m 

CSC 

Bvv 

sin 

where  v  =  x  y  z  .  The  internal  phase  angles  of  the  velocity  and  momentum  com¬ 
ponents  are  written  as^’»^®»^° 

6  =6  =6+6  -  0^  -  eL  (23) 

VV  pv  V  VV  t  tt 

The  differentials  of  the  complex  number  components  of  the  single  particle  momen¬ 
tum  can  be  written  in  two  equivalent  ways^^’^®»^° 

dp  =  sec  6  dp  exp[j(6  +6  )1  (24) 

*^v  pvpv  ^v  pv  pvpv  ^ 

=  CSC  6  p  d0  exp[j(6  +6  )]  (25) 

pvpv  ‘^v  pv  pv  pvpv 


where 


tan  8  =  p  96  /9p 

pvpv  '^v  pv  ^v 

tan  B  =  V  96  /9v 

VVVV  V  VV  V 


and  where 


6  “  B 

pvpv  VVVV 


For  high-T^  materials  the  physically  interesting  spacetime  states  are  the  par¬ 
tial  spacetime  coherence  case  of  nearly  incoherent  space  and  nearly  incoherent 
time  which  described  the  normal  state,  and  the  case  of  coherent  space  and  coher¬ 
ent  time  which  describes  the  superconducting  state.  Ordinary  metallic  conduc¬ 
tors  correspond  to  incoherent  space  and  incoherent  time. 
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The  angitlar  speed  of  a  body  moving 
broken  internal  symmetries  is  written  as 

17,H,20 


in  the  x,y  plane  in  spacetime  with 
a  complex  number  in  internal  space  as 


u  ■  (1)  exp(j0^)  *  di/dt^ 

where 


0)  “  sec 

cos 

3^ 

‘^tt 

d4)/dt 

4» 

*  CSC 

cos 

3* 

'^tt 

* 

=  sec 

^4,4, 

sin 

3* 

•^tt 

t-^  d4./d0* 

*  CSC 

^♦4> 

sin 

3**’ 

^tt 

6=0, 

6*  - 

-  3* 

U)  ^ 

>4' 

t 

^tt 

where 


(29) 

(30) 

(31) 

(32) 

(33) 

(34) 


tan 


4)36  .  / 3<^ 
4> 


tan 


4>  t  <1 


(35) 

(36) 


and  where  t^  is  the  time  associated  with  the  azimuthal  angle  . 

The  linear  acceleration  of  a  particle  in  broken  symmetry  spacetime  is 
written 


a  “a  exp(16  )  =  dv  /dt 
V  V  av  V  V 


2  -2 
d  v/dt 

V 


(37) 


where  v  =  x,y,z  ,  rorp  .  The  case  v  *=  r  corresponds  to  the  linear  radial 
acceleration  in  the  x,y  plane,  so  that 


a  =  a  exp(j0  )  =  dv  /dt  =  d^r/dt^ 
r  r  ar  r  r  r 


(38) 


The  magnitude  and  internal  phase  angle  of  the  linear  radial  acceleration  can  be 
written  in  a  number  of  different  ways  that  are  appropriate  for  the  various 
states  of  coherence  and  incoherence  of  the  space  and  time  variations.^®  For 
the  case  of  nearly  incoherent  space  (6rr  nearly  incoherent  time 

(6?t  0)  the  magnitude  of  the  linear  radial  acceleration  is  written  as^® 

a  =  cos  8^  sec  3  d/dt(cos  3^  sec  3  dr/dt)  (39) 


e 

ar 


e 

r 


+  3 


rr 


+  3 

vrvr 


^tt) 


(40) 
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(41) 


where  Stt  ^rvr  a*'®  given  by  equations  (9)  and  (27)  as 


tan  ®tt  “ 


tan  3  *  V  30  /3v 

vrvr  r  vr  r 


where  t^  is  the  time  associated  with  the  radial  coordinate  motion. 

The  magnitude  of  the  linear  radial  acceleration  term  that  is  appropriate 
to  describe  nearly  coherent  space  (3rr  ii/2)  and  nearly  coherent  time 
(B^t  '''  '"/2)  is  given  by  the  following  expressions^® 

a  -  sin  3^,  (C^^  +  (43) 

r  tt  rt  rt  r  ■' 


=  sin  3^  CSC  3  C  ^/t 
tt  vrvr  rt  r 


where 


C  ^  =  Sin  3^^  CSC  3  r/t  dS  /d0f[d0  /d0^  -  1  +  d/d0^(B  -  3^,)]  (44) 

rt  tt  rr  r  r  t  r  t  t  rr  tt 

Drt  =  d/d0^(sin  3^^.  esc  3^^  r/t^  d0^/d0^)  (45) 

tan  3„„,^  =  /D  (46) 

vrvr  rt  rt 

CSC  3  =  (C^^  +  ^  (47) 

vrvr  '  rt  rf^  rt  '  ' 

and  where  the  internal  phase  angle  of  the  radial  acceleration  is 

6  -  2^9^  +  3^J  (48) 

ar  r  rr  vrvr  t  tt 


For  an  attractive  force  and  a  negative  value  of  the  linear  radial  acceleration 


it  is  convenient  to  write^® 


a  =  a'  exp(i0  ) 
r  r  ar 


where 


6  =6  +  TT 

ar  ar 


=  0+3  +3 

r  rr  vrvr 


2(0^  +  6^^)  +  'T 


For  a  positive  acceleration 
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+ 

a  ■  a 
r  r 


e'*’  -  e'  -  0  +  B  +  b'*'  -  2(0^  +  B*^  ) 

ar  ar  r  rr  vrvr  ^  t  tt' 


B  -  B  +  IT 

vrvr  vrvr 


where  Byrvr  is  given  by  equation  (47). 


The  general  conditions  for  nearly  incoherent  spacetime  are^ 


6  0 
vv 


Vi 

0^  'V  constant 


6^  'V  constant 


where  v  and  t^  are  variables.  The  differentials  of  the  spacetime  coordinates 
for  this  special  case  are  obtained  from  equations  (7)  and  (8)  as 

dtj  =  exp(j6^^)  (57) 

dv^  =  dv  exp(je^)  (58) 

For  this  special  case  of  nearly  incoherent  spacetime  the  momentum  equations 
(19)  and  (23)  give 


p^  =  mv^  =  mdv/dt 


6^  =  9^  -  =  constant  B^  =0 

pv  V  t  pvpv 


and  equation  (24)  becomes 


dpj  =  dp^  exp[j(0j  -  e^^)] 


where  v  =  x,y,z,rorp  for  the  linear  momenta.  For  rotational  motion  in  near¬ 
ly  incoherent  spacetime  v  =  <})  and  equations  (55)  and  (56)  give 


''  constant 


<P<I> 


9  'V  constant 


and  equations  (30)  and  (34)  give 
0)^  =  d(()/dt. 


1  1  <i)  i 

9  =9,-0^  =  constant 

w  <J)  t 
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For  nearly  incoherent  spacetime  in  the  radial  direction  v  -  r  .  ^  • 

'V'  0  and  ^  0  so  that  equations  (39)  and  (40)  give 


a  «  d  r/dt 
r  r 

9^  “  0^  -  29^^  “  constant 

ar  r  t 


(66) 

(67) 


For  exactly  incoherent  spacetime  all  internal  phase  angles  of  the  space  and 
time  coordinates  have  zero  values,  i.e.,  0^  =  0  and  0^^  =  0  . 


For  coherent  spacetime 


K.  - 


t  «  constant 

V 


2  0 


6  =  ir/2 

vv 


V  =  V  =  constant 


(68) 

(69) 


where  0^  and  0y  are  now  variables.  The  differentials  of  the  time  and  space 
coordinates  are  obtained  from  equations  (7)  and  (8)  as 


dt*^  =  jt‘^d0^ 
dv^  =  jv‘^d0 


(70) 

(71) 


For  coherent  spacetime  equations  (22)  and  (23)  give 


v*^  =  v^/t^  d0  /d0^ 

V  V  t 


C  V 

e  =0  -  0 

vv  V  t 


(72) 

(73) 


For  the  radial  coordinate  these  equations  become 


6^  =  Tr/2 

tt 


t  =  t  =  constant 
r  r 


dt'^  =  jt‘^d0^ 
r  r  t 

v^  =  r^/t^  d0  /d0*^ 
r  r  r  t 


c  r 

0  =0  -  0 

vr  r  t 


8  =  •ir/2 

rr 


r  =  r  =  constant 


dr^^  =  jr'^dO 


(74) 

(75) 

(76) 

(77) 

(78) 


The  case  of  rotational  motion  in  coherent  spacetime  is  described  by 


(79) 
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constant 


(80) 


^  ^  ■  constant 


-  ji^de 


and  equations  (33)  and  (34)  give 
<P  9  t 

=  e  -  0? 

0)  9  t 


where  6^  and  6^  are  now  variables. 


The  complex  number  linear  radial  acceleration  for  a  particle  moving  in 
coherent  space  and  coherent  time  under  the  influence  of  an  attractive  force  is 
obtained  from  equations  (38)  and  (43)  through  (48)  to  be^® 

=  a'  e*p(J6^^)  -  E'/if  -  JF^^)  (84) 


r  r  rt  rt  ^ 


c  ,2,1/2 


6^^  =  6  +  6^^  -  26^  -  n/2 

ar  r  vrvr  t 


C^.  =  T^/t^  E®. 
rt  r  rt 


rt  r  rt 


=  de  /d0'(de  /de^  -  i) 
rt  r  t  r  t 


=  d^0  /d0^^ 
rt  r  t 


F*^  >  0 
rt 


tan  /F^. 

vrvr  rt  rt 


B  =  -  x/2  +  6  ^ 
vrvr  rt 


tan  6  =  F%/|E%| 

rt  rt  '  rt' 


Combining  equations  (86)  and  (91)  gives  the  internal  phase  angle  of  the  accel¬ 
eration  of  a  particle  in  coherent  space  and  coherent  time  under  the  influence 
of  an  attractive  force  as 


0*^  =0  +  6  ^  -  20^  -  TT 

ar  r  rt  t 


which  has  values  in  the  neighborhood  of  -  ir  .  For  an  attractive  force  it  is 
convenient  to  write  the  acceleration  as^® 


— c  c  *  c  * 

a  =  a  expv16  ) 
r  r  ar 
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where 


r  r  r  rt  rt  ‘ 


+11 

ar  ar 


(95) 

(96) 


0  +  -  20^  +  it/2 

r  vrvr  t 


=  0  +  6  ^  -  20^ 
r  rt  t 


where  0^^  is  a  small  number. 


For  repulsive  forces  the  acceleration  of  a  particle  in  coherent  spacetime 
is  given  by 


5*^+  = 

r  r  rt  -*  rt^ 

a*^'*’  =  r^/t‘^^[(E‘^'*’)^  +  (F*^*^)^]^^^ 
r  r  '  rt  rt*^  ‘ 

0‘^‘'’  =  0  +  B*^'*’  -  20^^  -  ir/2 

ar  r  vrvr  t 


with 


jC+ 

^rt 


rt 

c 


F^+  .  .  F 

rt  rt 

tan  6'='"  -  E't/F't 

vrvr  rt  rt 

B^^"^  =  b'^  +  it 

vrvr  vrvr 


E^"!^  ^  0 
rt 

F^'l  <  0 

rt 

tan  6  = 

rt  '  rt'  rt 


(97) 

(98) 

(99) 

(100) 
(101) 
(102) 
(103) 


=  ir/2  +  6 


and 


rt 


C+  C  *  y 

0  =  0  =  0  +  6  -  20 
ar  ar  r  rt  t 


(104) 


so  that  0|^  is  a  small  angle.  Finally  if  a^  describes  a  negative  acceleration 

(105) 


_c+  _c 

a  =  -  a 
r  r 


Note  the  different  values  of  Bvrvr  ^vrvr  equations  (91)  and  (103)  for 
attractive  and  repulsive  forces. 


For  the  special  case  of  a  solution  of  the  form 


20 
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(106) 


0  -  a  0"  +  8 

r  r  t  r 


it  follows  chat  for  coherent  motion  in  the  radial  coordinate 


-  0 

rt 


f"*  -  0 

rt 


(107) 


so  that  equations  (91),  (92),  (102),  (103)  and  (104)  give  for  this  case 


6  -  -  ii/2 

vrvr 


0*^  =  =  0  -  26^^ 
ar  ar  r  t 


6“='^  -  "/2 
vrvr 


6^t-0 


(108) 

(109) 


and  therefore  for  this  special  case  equations  (24)  and  (25)  give 


dp*^  =  jp^d0 
•^r  -"^r  pr 

dv*^  =  iv^dO 
r  ■'  r  vr 

(110) 

c+  c 

dp  =  jp  d6 
'^r  r  pr 

,— c+  ._c 

dv  =  iv  d6 

r  *’  r  vr 

(111) 

where  for  coherent  spacetime 

P,  =  Prc  (112) 

with  p^^  «  constant.  In  the  general  case  of 

^  <“3) 


where  v=x,y,z,rorp  ,  it  follows  that  for  coherent  spacetime  and  equations 
(24)  and  (25) 


dp*^  =  jP^dO 
*^V  V  pv 

dp*^'*'  =  jp^^dO 
*^V  pv 


where 


dv*"  =  jv^dO 
dv^"*"  »  jv*^d8 

V  V  w 


(114) 

(115) 


_c 

Pv 


p  exp(i0  ) 
‘^vc  pv 


(116) 


These  expressions  are  used  in  Section  3  to  evaluate  the  momentum  space  integrals 
that  describe  thermionic  emission  from  the  superconducting  state  of  high-Tc  ma¬ 
terials. 

2.  NON-KEPLERIAN  MOTION  OF  ELECTRONS  IN  COOPER  PAIRS  OF  HIGH-Tr  SUPER- 
CONDUCTORS.  This  section  describes  the  dynamical  behavior  of  electrons  in  Cooper 
pairs  which  occur  in  the  superconducting  state  of  a  high-Tc  material .  A  Cooper  pair 
is  a  weakly  bound  system  of  two  electrons  which  are  held  together  by  a  weak  at¬ 
tractive  force  which  is  mediated  by  the  vibrations  (phonons)  of  the  crystalline 
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lattice  of  the  hlgh-Tc  superconductor.®"®  The  classical  description  of  this 

bound  system  is  given  by  the  equation  of  motion  of  two  electrons  orbiting  their 
center  of  mass 


Mg(a^  -  rw  ) 


(117) 


where  Pg  =  me/2  =  reduced  mass  of  the  electron,  mg  “  mass  of  the  electron, 
r  -  2a  =  relative  distance  between  the  two  electrons,  a  ■=  distance  from  center 
as  mass  to  either  electron,  and  where  the  linear  radial  acceleration  term  a^  is 
given  by 


a^  =  d  r/dt 

and  the  angular  frequency  is  given  by 
01  =  d<|)/dt 


(118) 


(119) 


where  (ji  =  azimuthal  angle  of  an  electron  in  it*s  orbit.  Equation  (117)  simpli¬ 
fies  for  the  case  of  a  circular  orbit  with  r  =  2a  =  constant  so  that  equation 
(117)  becomes 


0)  =  -  F  /(rp  )  =  -  F  /(am  ) 

1  e  i  e 


(120) 


where  F.  <  0  for  an  attractive  force,  and  a  =  radius  of  orbit.  Equation  (120) 
shows  that  the  orbital  frequency  varies  inversely  with  the  orbital  radius  even 
if  the  attractive  interaction  force  F^  were  independent  of  the  relative  dis¬ 
tance  between  the  electrons.  In  fact  the  pairing  force  between  the  two  elec¬ 
trons  of  a  Cooper  pair  is  an  attractive  inverse  square  law  given  by^“® 

F^  =  -  b/r^  (121) 

where  b  >  0  ,  so  that  equation  (120)  becomes 

0)^  =  b/(p  r^)  =  b/(4m  a^)  (122) 

6  0 

which  is  essentially  Kepler's  law  of  central  field  motion. This  section  con¬ 
siders  only  attractive  forces. 

In  the  normal  and  superconducting  states  of  a  high-Tg  material  the  space 
and  time  coordinates  of  the  electrons  in  Cooper  pairs  exhibit  broken  internal 
symmetries  and  must  be  written  in  the  fora  of  equations  (4)  and  (5).  Therefore 
for  a  high-Tg  material  the  equation  of  motion  of  the  electrons  in  a  Cooper  pair 
are  written  as 

p^(5^  -  ra^)  =  F.  (123) 

where  aj-  =  complex  number  linear  radial  acceleration  given  by  equations  (38) 
through  (48) ,  ai  =  complex  number  angular  speed  given  by  equations  (29)  through 
(34) ,  and  where  r  =  complex  number  relative  distance  between  the  two  electrons 
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and  4  ~  co«plex  nuaber  azlauthal  angle  of  the  electrons  %ihich  are  written  as 
r  -  r  exp(jep  ♦  -  ♦  exp(je^)  (124) 

The  associated  complex  number  time  coordinates  are  given  by 

t^  -  t^  exp(j0')  t^  -  t^  exp(jej)  (125) 

The  complex  number  electron-electron  interaction  force  is  written  as 

exp(j0p^)  (126) 


For  an  attractive  inverse  square  force 
F^  »  -  b/r^ 

-  b/r^  =  -  26^  +  0^  (128) 


where  b  is  taken  to  be 

b  =  ge^/(4iT)  (129) 

where  g  *  complex  number  electron-phonon  interaction  constant,  and  e  =  electron 
charge.  These  complex  number  parameters  are  written  as 

b  »  b  exp(j0j^)  g  =  g  exp(j0g)  (130) 

so  that  equation  (129)  can  be  written  as 

b  =  ge^/(4Tr)  0^  =  9^  (131) 

where  b  >  0  and  g  >  0  . 

The  total  angular  momentum  of  the  two  electrons  in  a  Cooper  pair  is  a  complex 
number  in  internal  space  for  spacetime  with  broken  internal  symmetries  and  is 
written  as 

exp(jej^^)  =  (132) 


where  f  =  complex  number  relative  radial  distance  between  the  two  electrons,  and 
w  =  complex  number  angular  speed  which  is  given  by  equations  (29)  through  (36). 
The  law  of  the  conservation  of  angular  momentum  for  this  case  is  written  as^^ 


L  =  u  r  0)  =  constant 
z  e 


L  =  y  r  0)  =  constant 
z  e 


0,  =20  +0  =  constant 

Lz  r  w 


(133) 

(13A) 

(135) 
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where  u  Is  given  by  equations  (30)  through  (33)  and  6^,  is  given  by  equation 
(34).  Combining  conditions  (34)  and  (135)  gives 

0  -  20^  +  0  +  6  “  ~  ■  constant  (136) 

Lz  r  <p  t  tt 

where  is  given  by  equation  (36) , 


A  full  solution  to  equation  (123)  requires  that  the  real  and  imaginary 
parts  of  this  equation  be  obtained  and  solved  jointly.  This  procedure  leads  to 
complicated  equations  which  are  not  easily  solved.  A  simpler,  but  approximate, 
procedure  is  to  assume  that  the  internal  phase  angles  of  each  term  in  equation 
(123)  are  equal.  This  gives  the  following  two  equations  for  the  case  where  the 
linear  radial  acceleration  term  is  positive 

M^(a^  -  m^r)  =  (137) 
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where 
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(138) 


(139) 


and  the  following  two  equations  for  the  case  when  the  linear  radial  acceleration 
term  is  negative 


p  (a'  -  oj^r) 
e  r 
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(140) 


0'  =6  +20 
ar  r  10 


(141) 


where  for  this  case 


6'  =  6  +  Ti 

ar  ar 


(142) 


In  both  cases  0ai-  =  =  small  numbers,  and  is  given  by  equation  (43)  while 
(D  is  given  by  equation  (30)  for  the  conditions  of  interest  in  this  paper.  Com¬ 
bining  equations  (53)  and  (138)  gives  for  a  positive  linear  acceleration  term 
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(143) 


=  0  +20 
r  0) 


where 

=6  +11  (144) 

vrvr  vrvr 


For  a  negative  linear  acceleration  term  equations  (51)  and  (141)  give 


0^.  =  0  +  3  +3  -  2(0^  +  3^,)  +  IT 

Fi  r  rr  vrvr  t  tt 


(145) 


=  0  +  20 


BH 


Equations  (143)  through  (145)  are  valid  for  the  case  of  nearly  coherent  space 
and  tlae  In  the  radial  direction.  Cosiblnlng  equations  (135)  and  (136)  or  (141) 
gives 


6-.  -  26-  -  30 

Fi  Lz  r 


(146) 


(147) 

(148) 


The  quantity  6^^  >  6^^  »  6pj^  enters  into  the  calculation  of  the  normalized  super¬ 
conductivity  energy  gap  for  high-T(.  superconductors.^** 

For  an  attractive  inverse  square  law  described  by  equation  (127)  it  fol¬ 
lows  from  equations  (51),  (128)  and  (141)  or  equivalently  from  equations  (53), 
(54),  (128)  and  (138)  that 


36+6  +6  -  2(0f  +  6^^)  +  It  =  e, 

r  rr  vrvr  '  t  tt'  b 

36  +  26  =  6. 

rub 


(149) 

(150) 


Equation  (149)  can  also  be  written  as 

36+6  +6'*’  -  2(6^  +  6^^)  *  6. 

r  rr  vrvr  t  tt  b 


(151) 


Equations  (135),  (146)  and  (150)  yield 


6  =  26^  - 
r  Lz  b 

6  »  26,  -  36, 

u  b  Lz 


36.  -  46, 
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(152) 


(153) 


where  0^,  is  given  by  equation  (34).  Therefore  within  the  approximations  given 
in  eqoations  (138)  and  (141)  it  follows  that  Bj.  ,  and  6^  are  constants. 
Equations  (34),  (151)  and  (153)  show  tiiat 


2(0  +  6  -0'^-6'*')  =  6  +6  -  2(6^  +  6^-)  + 
*  ‘^(bd)  t  tt  rr  vrvr  t  tt 


(153A) 


=  6+6'*'  -  2(e!^  +  6fj 

rr  vrvr  t  tt 


=  2(26^  -  36^^) 

=  constant 

which  is  an  approximate  equation  relating  the  radial  and  azimuthal  internal 
phase  angles  of  space  and  time. 

The  case  where  the  radial  acceleration  is  totally  coherent  in  space  and 
time  is  described  by  the  conditions 
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11/2 
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For  this  case  equations  (143)  and  (145)  become 

e_  -  0  +  -  2ef  +  11/2 

Fi  r  vrvr  t 
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where  is  given  by  equation  (91)  so  that 


-  0  -  20"  +  6  , 
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or 

-  «rt  -  26^  «57) 

■  2(20.  -  30,  )  ■  constant 

D  LiZ 

where  0^  -  constant  given  by  equation  (153).  Equation  (156)  is  seen  to  agree 
with  equations  (96)  and  (104).  Combining  equations  (146)  and  (156)  gives 

0,  -  20  +6  ^/2  -  (158) 

L2  r  rt  t 


which  is  valid  for  the  special  case  of  coherent  space  and  time  in  the  radial 
direction.  For  a  central  force  equations  (128)  and  (156),  or  equivalently  equa¬ 
tion  (149)  gives 

0,  =  36  +  6  ,  -  20’^  (159) 

b  r  rt  t 


where  in  these  equations  6^  =  constant  given  by  equation  (152). 
and  (159)  can  be  used  to  determine  6^  as 


Equations  (158) 


0^  =  30,  -  20,  +  6  ^/2 
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where  6^.^.  is  given  by  (88),  (89)  and  (92),  so  that  0^  is  also  a  constant. 

The  following  conditions  are  valid  when  the  space  and  time  coordinates 
are  coherent  in  the  radial  direction  but  the  angular  space  and  time  coordinates 
are  nearly  incoherent 


6  ^  = 

11/2 

(161) 
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tt 
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(162) 
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(163) 
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where  6^  and  are  constants  which  would  have  zero  values  for  the  case  of 
exact  Incoherence  of  the  azimuthal  space  and  time  coordinates.  Combining  equa¬ 
tions  (30),  (34)  and  (162)  gives 


u)  >  d^/dt 

0  -  ej  - 

0)  ^  t 


(164) 

(165) 


Combining  equations  (7),  (8),  (11),  (12),  (161)  and  (162)  gives 


df  =  jfd0 


(166) 


dt  =  jt  de^ 
r  r  t 


(167) 


d^  =  d<<)  exp(j8  ) 


(168) 


dt  =  dt  exp(je^^) 
9  <P  t 


(169) 


Then  combining  equations  (157)  and  (165)  gives 

> 


(170) 


Introducing  equation  (160)  into  equation  (170)  gives 

«rt  -  -  ef ) 


(171) 


which  can  be  rewritten  as 


20  -  30^  “  el  - 

b  Lz  <t>  t 


(172) 


Equations  (152),  (156),  (160)  and  (172)  give 


0^  =  0  -  20  +  6  , 
Fi  r  t  rt 


(173) 


=  0r  +  2(20^  -  38^^) 


=  0^  +  2(0;  -  0^  ) 


=  30,  -  40_ 
b  Lz 


where  0j.  is  given  by  equation  (152).  If  the  constants  b  and  are  related  in 
the  following  manner 
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or  equivalently  as 

2  3 

z 


20.  -  30, 
b  Lz 


(175) 


where  f  is  a  real  number  constant,  then  it  follows  from  equations  (153),  (165), 
(172)  and  (175)  that 
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(176) 


while  equations  (160)  and  (173)  give  for  this  special  case 


«rt  ■  “t 
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(177) 

(178) 


which  is  valid  for  the  general  case  6  4  0  . 


Consider  now  the  possibility  of  the  case  where  the  space  and  time  coordi~ 
nates  are  coherent  in  both  the  radial  and  the  azimuthal  directions  which  is  de¬ 
scribed  by 


6  =  Tr/2 

rr 


■  '^2 


=  ti/2  (179) 


where  now  6^.  ,  0^.  ,  6^  and  6^  are  variables.  For  this  case  equations  (143)  and 
(144)  become 


-  20^  +  ir/2  =  2(0  .  -  0^) 
vrvr  t  (t)  t 


(180) 


and  then  using  equation  (91)  gives 


^t  -  26;  =  2(6,  -  ep 


(181) 


which  is  similar  in  form  to  equation  (170)  except  that  the  right  hand  side  of 
equation  (170)  is  a  constant  while  the  right  hand  side  of  equation  (181)  is  a 
variable  because  for  this  case  equations  (33)  and  (34)  give 


u)^  =  d0  /d0j 

({.(ft 


(182) 


0=6,- 
w  cp  t 


(183) 


which  are  variables.  But  within  the  approximation  used  in  equations  (157)  and 
(160)  it  follows  that  for  incoherent  spacetime  in  the  azimuthal  direction 


6  20’^  =  2(20.  -  30,  ) 

rt  t  b  Lz 


(184) 


which  is  a  constant, 
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For  the  case  where  space  and  time  are  coherent  in  the  radial  direction  it 
is  often  expedient  to  assume  a  linear  relationship  between  6^  and  sE  of  the 
form*® 


0  -  o  +  6 

r  r  t  r 


Or  <  1 


(185) 


where  the  coefficient  o^.  is  determined  by  the  nature  of  the  pairing  force  which 
is  related  to  the  atomic  structure  of  the  crystal  lattice  of  the  high-Tc  materi¬ 
al.  Combining  equations  (88),  (89)  and  (185)  gives 


E\  =  o  (o  -  1)  F\ 

rt  r  r  rt 

while  equations  (90)  and  (91)  give 
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E  =  -  tt/2 
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(186) 


(187) 


and  equations  (96)  and  (104)  become 

0*^"^  =  0C'  =  0  _  20^^ 

ar  ar  r  t 


(188) 


Combining  equations  (157)  and  (187)  gives  for  this  special  case  of  coherence 
of  space  and  time  in  the  radial  direction 


6  =  -  e: 

u  t 


(189) 


while  equation  (156)  becomes 

-  26' 

=  0  +20 
r  0) 

=  20,  -  30 
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where  equation  (158)  gives 

0,  =  20  -  0^ 

Lz  r  t 


(191) 


For  an  inverse  square  electron-electron  pairing  force  equations  (159)  and  (187) 
give 

0,  =  30  -  20^  (192) 

b  r  t 

Equations  (185)  and  (192)  give  for  an  inverse  square  pairing  force 


a  =  d0  /d0’^  =  2/3 
r  r  t 


(193) 


so  that  equation  (186)  gives  for  this  case 
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(194) 


-  1)  -  -  2/9 


Combining  equations  (152),  (153),  (160),  (186)  and  (187)  gives  the  following 
results  for  an  inverse  square  type  of  pairing  force  and  the  assumption  of  the 
validity  of  equation  (185)  which  has  firt  -  0 


0^  «  30,  -  20. 

t  Lz  b 

«Fi  =  3®b  -  ^«Lz 

(195) 

0  «  20,  -  0. 
r  Lz  b 

(196) 
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(197) 

Within  the  approximations  made  in 
constants.  A  further  example  of 
(190)  which  give 

this  section  all  internal  phase  angles  are 
this  is  obtained  from  equations  (188)  and 

0*^'  -  30.  -  40, 
ar  b  Lz 

(198) 
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In  the  special  case  when  equation 
through  (199)  that 

(175)  is  valid  it  follows 

from  equations  (195) 
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6*='  =  e„.  =  9,_/3  =  e,  /2  =  0 
ar  Fi  b  Lz 
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The  intrinsic  signs  of  negative  because  the  intrinsic  sign  of  dp 

is  negative . ^ ^ ® 


For  the  case  of  coherent  spacetime  in  the  radial  direction  of  the  orbiting 
electrons  of  a  Cooper  pair,  equation  (85)  gives  the  magnitude  of  the  relative 
radial  acceleration  of  the  two  electrons  as 


c  c,  c2r,„c  ,2  , 
a  =  r  /t  (E  J  + 
r  r  rt 


(203) 


where  r^  =  constant  magnitude  of  the  relative  separation  of  the  two  electrons, 
t^  =  constant  magnitude  of  the  time  for  the  radial  direction  and  where  E^j-  and  F^j- 
are  given  by  equations  (88)  and  (89).  The  acceleration  of  an  electron  relative 
to  the  center  of  mass  of  the  electron  pair  is  given  by  3^/2  .  The  value  of  t^ 
can  be  taken  to  be  a  characteristic  time  of  the  electron  pair  system  -  the  Bohr 
time  tg  of  the  electron  pair  orbiting  a  lattice  charge  of  Z  =  2 

=  tg  =  h^/(m^e^)  =  S/(2lEg|)  (204) 
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where  R  =  h/ (2v)  ,  h  =  Planck's  constant.  Eg  =  -  e  /(Za^)  =  energy  of  an  elec¬ 
tron  in  the  ground  state  Bohr  orbit,  ao  =  Bohr  radius,  and  mg  =  mass  of  the 
electron.  The  factor  of  2  in  equation  (204)  results  from  the  product  of  a  fac¬ 
tor  of  4  which  results  from  =  4  and  a  factor  of  1/2  which  results  from  the 
reduced  mass  of  the  electron  pair  Uq  =  mg/2  .  Then  the  equation  of  motion  for 
the  electron  pair  with  coherent  spacetime  coordinates  in  the  radial  direction 
and  incoherent  spacetime  coordinates  in  the  azimuthal  direction  is  obtained  from 
equation  (137)  to  be 

where  pg  =  m^/Z  =  reduced  mass  of  the  electron,  and  where  o)  =  d4>/dt  =  incoherent 
angular  speed  of  the  electrons.  For  the  case  of  a  weak  electron-electron  inter¬ 
action  force  '\y  0  ,  and  equation  (205)  becomes 

0)  =  t~^(E^t>^  +  (206) 


The  incoherent  angular  speed  of  the  electron  pair  required  for  equilibrium  is 
therefore  independent  of  the  relative  distance  between  the  two  electrons, 
and  represents  non-Keplerian  motion.  For  comparison  equation  (122)  represents 
Keplerian  motion.  Equation  (206)  is  only  valid  for  a  weak  pairing  force. 


The  equilibrium  radius  a^  of  the  orbit  of  the  electrons  in  a  Cooper  pair 
is  obtained  from  equations  (134),  (204)  and  (206)  as 


a  =  i  /  (m  ui) 
c  z  e 

=  £  t^m"^[(E^  + 

z  B  e  rt 


2-lM 

rt 


(207) 


where  -  angular  momentum  of  an  electron,  where  =  total  angular  mo¬ 

mentum  of  the  two  electrons  given  by  equation  (134),  and  where  a^  =  x^H  where 
r^  =  equilibrium  distance  between  the  two  electrons  in  radial  coherent  spacetime 
Equations  (206)  and  (207)  show  that  both  m  and  a^  are  independent  of  the  nature 
of  the  weak  pairing  force  Fj_  .  The  angular  momentum  of  an  electron  is  quantized 
in  the  usual  manner^ ^ 


L  =  mfi 
z 


(208) 


where  m  =  magnetic  quantum  number  which  can  take  the  values  m  =  0  ,  ±1  ,  ±2  ,  ±3  ,  •  • 
Combining  equations  (207)  and  (208)  gives  the  quantized  circular  orbits  of  the 
electron  in  a  Cooper  pair  of  a  high-Tg  superconductor  as 


a^  =  m^it^m  ^[(E^  )^  + 
cm  B  e  rt 


(209) 


where  a^^^  =  orbit  radius  corresponding  to  the  magnetic  quantum  number  m. 

c  c 

For  high-Tg  superconductors  Ej-g  and  Fgg  can  be  obtained  from  equation 
(186)  so  that  equations  (206)  and  (209)  become  respectively 
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(210) 


-If  m1/2 

w  -  tg  [a^(l  -  a^)l 

2  —I  —1  /2 

®cin  “  ‘  ^^ll) 

Combining  equations  (193),  (210)  and  (211)  gives 

(0  *  «  »^/3m  e^/R^ 

B  e 

a^  =  3//2mJit_m  ^  =  3//2mfi^/(m  e^)^ 
cm  Be  e 

Equation  (213)  gives  the  radius  of  the  m*th  orbit  as 

a™  =  (3//2)^/2m^/2a  (214) 

cm  o 

=  1.456m^^^a 

o 

where  the  Bohr  radius  Sq  is  given  by^* 

Sq  =  (215) 


(212) 

(213) 


The  energy  of  a  bound  electron  in  the  pair  can  be  obtained  from  equation  (214)  as 

=  -  0.343m'^^V/a  =  0.687m"^^^E„ 

o  B 

2 

where  m  =  magnetic  quantum  number  and  Eg  =  -  e  /(Zap)  is  the  energy  of  an  elec¬ 
tron  in  a  Bohr  orbit.  For  compaiison,  the  standard  expressions  for  the  radius 
and  energy  of  a  Bohr  atom  with  Z  =  1  is  given  by 

a  =  n\  (217) 

n  o 

E  =  -  e^/(2a  )  =  -  e^/(2n^a  )  (218) 

n  n  o 


where  n=  1,2,3,'**  is  the  principal  quantum  number. 

3.  THERMIONIC  EMISSION  FROM  HIGH-Tc  MATERIALS.  Thermionic  emission  re¬ 
fers  to  electrons  emitted  from  the  surface  of  solids  that  are  heated  to  some 
specified  temperature.  This  phenomenon  is  essentially  a  tunneling  process  be¬ 
cause  there  is  an  energy  barrier  of  a  few  eV  that  prevents  most  of  the  electron 
gas  in  a  solid  from  escaping . ^ ^  At  a  finite  temperature  however  some  elec¬ 
trons  in  the  solid  have  sufficient  energy  to  penetrate  the  barrier  and  escape 
from  the  solid  surface.  The  thermionic  emission  from  high-Tc  superconductors 
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is  relatively  small  but  it  contains  significant  information  about  the  electrons 
that  occur  in  weakly  bound  Cooper  pairs.  Therefore  it  is  important  to  develop 
a  theory  of  the  thermionic  emission  from  the  normal  and  superconducting  states 
of  high-Tj.  materials  and  to  compare  the  predicted  thermionic  emission  current 
with  measured  values.  In  this  way  the  values  of  the  internal  phase  angles  of 
the  electron  momenta  and  the  internal  phase  angles  of  the  space  and  time  coor¬ 
dinates  of  the  electrons  can  be  determined,  and  therefore  the  degree  of  coher¬ 
ence  of  spacetime  can  be  determined  for  hlgh-Tc  materials. 

A.  Rlchardson-Dushman  Thermionic  Emission  Equation. 

Before  considering  the  thermionic  emission  from  high-Tc  materials  a  brief 
review  is  given  of  the  standard  theory  of  thermionic  emission  from  ordinary  met¬ 
als.  The  conventional  picture  of  electrons  in  a  metal  is  that  they  form  a 

Fermi  gas  whose  distribution  function  is  given  by^^~^® 

f  -  {expKe  -  y)/(kT)l  +  (219) 

where  e  ■  kinetic  energy  of  an  electron,  y  ■*  chemical  potential,  k  ■  Boltzmann 
constant  and  T  *  absolute  temperature.  The  electron  kinetic  energy  is  written  as 

e  -  (P^  +  Py  +  (220) 

where  p^  ,  Py  and  components  of  electron  momentum,  and  where  m^  electron 

mass.  The  x  direction  is  taken  as  the  coordinate  axis  that  is  normal  to  the 
surface  of  the  metal.  For  the  electrons  Involved  in  thermionic  emission,  the 
kinetic  energy  per  electron  must  be  greater  than  a  critical  value  given  by 

<“1) 

where  ,  Pxk  '^xk  “  critical  value  of  the  electron  kinetic  energy,  critical 
electron  momentum  normal  to  the  surface,  and  critical  electron  velocity  normal 
to  the  surface  of  a  metal,  e  =  electron  charge,  and  =  work  function  such  that 
e<fr  ■  energy  required  to  remove  an  electron  which  is  at  the  top  of  the  Fermi  sea 
of  electrons.  Therefore  the  critical  value  of  the  electron  momentum  normal  to 
the  metal  surface  that  is  needed  to  just  eject  an  electron  from  the  metal  sur¬ 
face  with  zero  velocity  is  given  by 

p^j^  =  [2m^(y  +  e((i)]^^^  (222) 

For  metals  the  work  function  is  of  the  order  of  <j)  >  1  volt,  so  that  in  general 
for  thermionic  emission  equations  (220)  through  (222)  give 

E  >  e,  »  y  (223) 

k 

and  therefore  as  far  as  the  thermionic  electrons  are  concerned  the  distribution 
function  given  in  equation  (219)  can  be  written  as 

f  “v  exp[-  (t  -  y)/(kT)]  (224) 

=  exp[-  (p^  +  Py  +  ~  2m^y)/(2m^kT)] 
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(225) 


The  electron  number  density  of  a  Fermi  gas  is  written  as^^~^^ 

dn  -  f  dn  -  2/h^  f  dp  dp  dp 
e  p  '^x  *^y 

where  n^  ■>  electron  number  density  and  Op  number  density  of  momentum  states. 

The  thermionic  emission  current  can  be  written 

Ijj  “  (226) 

00  OO  00 

=  2e/(m^h^)  /  /  /  p^f  dp^dp  dp^ 

•00  —OO  n  ^ 

Pxk 

=  2e/(m  h^)J  J  J 
6  X  y  z 

where 


-  /  exp(-  (p^  -  2m^M)/(2m^kT)]p^dp^  (227) 

Pxk 
00 

J  =  /  exp[-  p2/(2m^kT)]dp  (228) 

*'  •00  ^  ^ 

OO  "■ 

J  =  /  exp(-  p^/(2m  kT)]dp  (229) 

«  2  Z 

•00 

The  integrals  can  be  evaluated  from  tables  with  the  result  that^® 

J  =  m  kT  exp[-  e<t)/(kT)]  (230) 

X  6 

1/2 

J  =  J  =  (27Tm  kT)  '  (231) 

y  z  e  ' 


where  the  term  -  e(t)/(kT)  in  equation  (230)  comes  from  the  lower  limit  of  inte¬ 
gration  Pxk  in  equation  (222)  and  (227).  Combining  equations  (226),  (230)  and 
(231)  gives  the  well  known  Richardson-Dushraan  thermionic  emission  equation^ ^ 

1^  =  exp(-  e4>/(kT)]  (232) 

where 

A  =  4Trm  ek^/h^  (233) 

o  e 

is  the  Richardson-Dushman  constant. 

B.  Thermionic  Emission  from  High-Tc  Materials. 

Because  the  factor  6/tt  ,  which  arises  from  the  assumption  of  coherent  time. 
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is  responsible  for  the  relatively  high  values  of  the  normalized  superconducting 
energy  gap  given  by  equation  (3),  it  is  reasonable  to  assume  that  the  electrons 
in  the  Cooper  pairs  are  weakly  bound. Therefore  as  a  first  approximation 


to  calculating  the  thermionic  emission  current  for  high-Tc  materials  it  will  be 
assumed  that  the  electrons  form  a  noninteracting  Fermi  gas  and  that  the  basic 
ideas  of  the  Richardson-Dushman  calculation  can  be  utilized.  For  the  normal 
state  of  a  high-Tf.  material  the  electrons  are  assumed  to  move  in  a  partially 
coherent  spacetime.  Then  the  complex  number  generalization  of  equations  (219) 
through  (225)  are^"* 

f  =  {exp[(e  -  ii)/(kT)]  +  l}  ^ 

(234) 

exp[-  (i  -  ij)/(kT)] 

=  exp[-  (p^  +  p^  +  p^  -  2m  ii)/(2m  kT)  ] 

X  y  z  e  e 

e  =  (p^  +  p^  +  p^)/(2m  ) 

*^x  *^y  e 

(235) 

'k  ■  -  S  +  »♦ 

(236) 

P^k  “  [2ing(P  + 

(237) 

dn  =  fdn  =  2/h^  f  dp  dp  dp 
e  p  *^x  ^y  ^z 

(238) 

where  ^  ,  py  and  pz  are  represented  as  in  equation  (18)  and  where  p^i^ 

^  are  written  as 

,  il  and 

Pxk  °  Pxk  “P^^pxk* 

(239) 

U  =  M  exp(j6^) 

(240) 

^  exp(j9^) 

(241) 

The  values  of  Pj^j^  and  9pxk  obtained  from  equation  (237)  by  writing 

^xk  “'P^^^^pxk^  "  exp(3®p)  +  exp(j0^)] 

whose  real  and  imaginary  parts  are 
2 

p  ,  cos(26  '  =  2ra  (u  cos  0  +  e(J>  cos  0.) 

‘^xk  pxk  e  M  (j> 

p^,  sin(20  ,  )  =  2m  (u  sin  0  +  e(^  sin  0.) 

‘^xk  pxk  e'  u  ^ 

Equations  (243)  and  (244)  give 

tan(20  ,  )  =  (p  sin  0  +  e(|)  sin0^)(u  cos  0  +  e(|)  cos  0.)  ^ 

9  999  1/2 

p  ,  =  2m  [u  +  e  <J)  +  2iJe<J)  cos(9  -  9.)] 

*^xk  e  U  ip 


(242) 

(243) 

(244) 

(245) 

(246) 
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The  momentum  constant  appears  as  a  lower  integration  limit  in  the  complex 
Integral  that  describes  thermionic  emission. 


The  complex  number  thermionic  emission  current  is  given  by 

I  •  efv  dfi  ■  e/m  fp  dn 
X  •'  X  e  '  ef  *^x  e 


z 


where 

00 

\  '"J  (Px" 

Pxk 

jy  “  /  ®*Pf"  Py/(2mgkT)]dpy 

-  “  2 

=  /  expt-  p^/(2mgkT)]dp^ 


(247) 


(248) 

(249) 

(250) 


The  complex  number  thermionic  current  in  equation  (247)  can  be  written  as 
'x  ■  ^x  “>’<->®Ix> 

and  the  integrals  in  equations  (248)  through  (250)  can  be  written  as 

exp(jej^)  (252) 

jy  =  Jy  exp(jejy)  (253) 

Comparing  equation^  (247)  and  (251)  through  (254)  gives  the  thermionic  emission 
current  as 


I  =  2e/(m  h  )J  J  J 
X  6  X  y  z 

=0^  +0-  +  0, 
Ix  Jx  Jy  Jz 


(255) 

(256) 


The  measured  thermionic  emission  current  is  given  by  the  real  part  of  equation 
(251)  as 


I  =  I  cos  0^ 
xm  X  Ix 


2e/(m  h  )J  J  J  cos(0^  +6.  +  e_  ) 

'e  xyz  Jx  Jy  Jz 


(257) 


Therefore  the  problem  at  hand  is  to  calculate  Jx  .  Jy  .  Jz  and  6 ,  0jy  i  6 .  In 
order  to  do  this  the  real  and  imaginary  parts  of  the  integrals  in  equations 
(2A8)  through  (250)  must  be  calculated. 

The  Integrals  in  equations  (248)  through  (250)  can  be  rewritten  using 
equations  (24)  and  (25)  as 


J  »  j  p  J  dp 
X  X  px  *^x 

Pxk 

00 

=  /  p  J  sec  6  exp[j(e_  +26  +6  )ldp 

X  px  pxpx  jpx  px  pxpx  ^  *^x 

Tr/6  _ 

=  /  p  J  CSC  6  exp[j(0.  +26  +8  )]d6 

■'  "^x  px  pxpx  jpx  px  pxpx  px 

pxk 


3  =  /  3  dp 
y  py 


=  /  J  sec  8  exp(j(0^  +6  +6  )ldp 

py  pypy  Jpy  py  pypy  y 

7r/6 

=  /  p  J  CSC  8  exp[i(6-  +6  +8  )]d6 

i  py  pypy  Jpy  py  pypy  py 


(258) 


(259) 


(260) 


(261) 


(262) 


(263) 


3  =  /  3  dp 

z  pz  *^z 

—00  * 

OO 

=  /  J  sec  8  exp[j(0T  +6  +8  )ldp 

pz  pzpz  jpz  pz  pzpz  *^z 

Tr/6 

=  /  p  J  CSC  8  exp[j(6,  +6  +8  )]d6 

^  *^z  pz  pzpz  Jpz  pz  pzpz  pz 

where 

^px  =  V  ®*P^J®Jpx^  '  ^Px  ■  2m^i^)/(2“ekT)] 

^py  “  “^py  ®*P^^®Jpy^  =  Py/<2n>/T)] 

V  “  V  “  “P^"  Pz/(2m^kT)] 


(264) 

(265) 

(266) 

(267) 

(268) 
(269) 
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exp{-  tPjj  ^08(26^^)  -  2bi^u  cos  e^]/(2m^kT)} 

exp{“  [Py  cos(2epy)]/(2m^kT)} 

exp{“  [pi  cos(20  )]/(2m  kT)} 

«•  ^ 


8,  -  -  [p  sin(20  )  -  2m  u  sin  0  ]/(2in  kT) 

Jpx  px'  u'  '  e  ' 


®jpy  “  -  tPy  sin(20py)]/(2m^kT) 


(270) 

(271) 

(272) 

(273) 
(27A) 
(275) 


The  lower  limits  of  integration  p,^^  ,  p^]^^  and  0mk  integrals  of  equations 

(258)  through  (260)  are  given  by  equations  (237),  (245)  and  (246).  The  upper 
limits  of  the  integrals  over  the  phase  angles  of  the  momenta  in  equations  (260), 
(263)  and  (266)  are  obtained  from  the  assumption  of  spacetime  coherence  which 
gives“*“«^“ 


0*=  =  0^^  -  0^  -  ir/3  -  ir/6  -  ir/6 

px  X  t 

with  similar  expressions  for  the  y  and  z  directions. 


(276) 


The  quantities  -Jx  •  ®jx  *  "^y  »  ®Jy  ♦  *^Jz  that  are  required  for  the  de¬ 

termination  of  the  measured  thermionic  emission  current  given  in  equation  (257) 
can  be  obtained  by  first  calculating  the  real  and  imaginary  parts  of  the  inte¬ 
grals  given  in  equations  (259),  (260),  (262),  (263),  (265)  and  (266)  as  follows 


J  p  J 

sec  3 

cos(0^  + 

20 

+ 

3 

)dp 

(277) 

ti/6 

pxpx 

Jpx 

px 

pxpx  ’X 

/  P^J 

CSC  3 

COS(0T  + 

20 

+ 

3 

)d0 

(278) 

%xk 

pxpx 

Jpx 

px 

pxpx 

px 

J  sin  0^  =  f  p  J  sec  6  sin(9T  +20  +  B  )dp 

X  Jx  X  px  pxpx  Jpx  px  pxpx  '^x 


p  X  px 
^xk 


=  r  p  J  CSC  6  sin(0^  +20  +3  )d0 

„  X  px  pxpx  Jpx  px  pxpx  px 


J  cos  0.  =  /  J  sec  6  cos(0^  +0  +3  )dp 

y  Jy  „  py  pypy  Jpy  py  pypy  y 


=  /  p  J  CSC  3  cos(9^  +0  +3  )d0 

o  y  py  pypy  Jpy  py  pypy  py 


(279) 


(280) 


(281) 


(282) 
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(283) 


J  sin  0-  *  f  J  sec  8  sin(8,  +0  +  S  )dp 

y  Jy  py  pypy  Jy  py  pypy  y 


f  p  J  CSC  8  sln(0  -  +  0  +8  )d0 

i  y  py  pypy  Jy  py  pypy  py 


(284) 


J  cos  0,  ■  /  J  sec  8  cos(0-  +0  +8  )dp 

z  Jz  ''  pz  pzpz  Jpz  pz  pzpz  *^z 


/  p  J  CSC  8  cos(0.  +0  +8  )d0 

^  '^z  pz  pzpz  Jpz  pz  pzpz  pz 


(285) 

(286) 


J  sin  0^  *  /  J  sec  8  sln(6  +0  +8  )dp 

z  Jz  ''  pz  pzpz  Jpz  pz  pzpz 


/  p  J  CSC  8  sinCS-  +0  +8  )d0 

^  *^z  pz  pzpz  Jpz  pz  pzpz  pr 


(287) 

(28&> 


These  Integrals  will  be  evaluated  for  special  cases  in  the  fallowing  secCioos. 

C.  Thermionic  Emission  from  the  Normal  State  of  a  High-Tc  Material. 

Consider  the  case  of  the  normal  state  of  a  hlgh-Tc  material  with  I  >  Tc 
for  which  the  electrons  are  in  a  nearly  Incoherent  spacetime  state  which  is 
described  by 


8  0 
yy 

8  '^0 
zz 

(289A) 

6«t-0 

8^  -v-O 
^tt 

^tt  "  0 

(289B) 

8  ''-0 
pxpx 

8  -v-  0 

pypy 

8  -v  0 

pzpz 

(290) 

which  corresponds  to 

w* 

0^  =  constant 

X 

0^  =  constant 

y 

0^  =  constant 
z 

(291) 

0*^  =  constant 

vi 

0^  =  constant 

0^^  =  constant 
t 

(292) 

From  equations  (23), 

(289),  (291)  and 

(292)  it  follows  that 

px  X  t 

=  constant 

(293) 

0^  =■  0^  -  0^^ 
py  y  t 

=  constant 

(294) 

0^  *  0^  -  0^^ 
pz  z  t 

=  constant 

(295) 

(296) 


Then  the  integrals  in  equations  (277),  (279),  (281)  and  (283)  become 


cos  0^  -  W,  cos(2e*  )  +  W,  sin(2e^  ) 

X  Jx  Lx  px  2x  px 

■"x  -  "lx  -  “2x 


y 

y 


cos 

®Jy 

“  ”iy 

cos 

0 

py 

+  U- 
2y 

sin 

0 

py 

sin 

®T 

Jy 

“  ”iy 

sin 

0^ 

py 

-  u, 

2y 

cos 

0^ 

py 

cos 

0^ 

*  u, 

cos 

0 

+  U- 

sin 

0 

z 

Jz 

Iz 

pz 

2z 

pz 

sin 

0^ 

“  u, 

sin 

0^ 

-  Uo 

cos 

0^ 

z 

Jz 

Iz 

pz 

2z 

pz 

where 


W,  “  /  P  cos  0^  dp 

lx  •  Tnv 


■xk 


x  px 


Jpx  *^x 


00 

"2x  ■  -  /  Px-'px  ®jpx  ‘•'’x 


(297) 

(298) 

(299) 

(300) 

(301) 

(302) 

(303) 


"^xk 


U 


ly 


/  cos  0^  dp 

py  dpy  y 


(304) 


u 


2y 


w 

-  f  sin  6^ 


py 


Jpy 


(305) 


U 


u 


Iz 

2z 


=  /  cos  9^  dp 
J  pz  Jpz  z 


=  -  /  sin  9^  dp 
_oo  Jpz  z 


where 


J 

J 

J 


i 

px 

i 

py 

i 

pz 


exp(g  -  c^p^) 
exp(-  CyPy) 
exp(-  c^p2) 


(306) 

(307) 

(308) 

(309) 

(310) 
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-  o  K  2 

Jpx  x*^x 

(311) 

8 .  “-bp 

Jpy 

(312) 

0 ,  "-bp 

Jpz  z*^z 

(313) 

and 

where 

-  cos(2e^^)/(2mgkT) 

(314) 

Cy  -  cos(2e^y)/(2m^kT) 

(315) 

c,  =  cos(20^  )/(2m  kT) 
z  '  pz''  '  e 

(316) 

b^  -  sin(20j^)/(2m^kT) 

(317) 

by  -  sin(20^y)/(2mgkT) 

(318) 

b  =  sin (20^  )/(2m  kT) 
z  pz  e  ' 

(319) 

g  »  (p  cos  0^)/(kT) 

(320) 

a  -  <p  sin  0^)/(kT) 

(321) 

The 

integrals  in  equations  (302)  through  (307)  will  now  be  evaluated. 

Combining  equations  (302)  through  (321)  allows  the  integrals  in 
(302)  through  (307)  to  be  written  as 

equations 

"lx  ■  1  -  V^Px-'Px 

Pxk 

(322) 

00 

Wjx  -  -  /  «P<8  -  SPx>Pl"<^  -  '>xP^Px‘‘Px 
^xk 

(323) 

00 

2  2 

U,  =  f  exp(-  c  p  )cos(b  p  )dp 

ly  _  «,  y  y  y  y  y 

(324) 

oo 

U-  =  /  exp(-  c  p^)sin(b  p^)dp 

^y  -00  y  y  y  y  y 

(325) 

00 

2  2 

—  00 

(326) 

00 

2  2 

U2z  =  /  exp(-  c^p^)sin(b^p^)dp^ 

(327) 

—00 


473 


The  integrals  in  equations  (322)  and  (323)  can  be  written  as 


w. 

lx 

•  B(Uj^  cos  a  +  sin  a) 

(328) 

w 

2x 

•  B(U2x  cos  a  -  sin  a) 

(329) 

where 

«lx 

00 

2  2 

-  /  exp(-  CxPx^‘^°®^^xPx^Px‘*Px 

Pxk 

(330) 

"2x 

GO 

2  2 

»  /  exp(-  c^p^)sin(b^p^)p^dp^ 

(331) 

Pxk 

where 

B  -  e®  (332) 

where  g  is  given  by  equation  (320) . 

The  evaluation  of  the  integrals  Ujy  and  U2v  for  v  ■»  x  ,  y  ,  z  can  be  ob¬ 
tained  from  integral  tables. The  results  are  as  follows 


“lx  ■  '’xPxk> 

"2x  ‘  "e"  '=x'’xk>*^"<^®Jx  *  '’x'’L> 

"lx  ■  "e“  +  Vxk  - 

«2x  -  ".kT  “P(8  -  'x^xk'®^"'^®™  +  '’x^xk  - 
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w,  - 

lx 

m^kT  exp{(M  -  p^j^/(2m^)  ]/(kT)  } 

(343) 

w°  - 

2x 

0 

(344) 

(2xmgkT)^^^ 

(345) 

“2P- 

0 

(346) 

"?  - 
Iz 

(2Trm  kT)^^^ 
e 

(347) 

= 

2z 

0 

(348) 

The  quanticles  Ujy  ,  U2v  »  and  W2x  enter  directly  into  the  calculation  of 
the  thermionic  emls'^ion  current. 

The  calculation  of  the  factors  and  their  associated  internal  phase 
angles  0jy  for  v  =  x  ,  y  ,  z  that  appear  in  eqiiations  (255)  through  (257)  for  the 
thermionic  emission  current  from  the  normal  state  follows  from  equations  (296) 
through  (301)  using  the  results  presented  in  equations  (333)  through  (340)  as 


sin  0j^  =  m^kT  exp(g  -  "  ^x^xk^ 

cos  0^  =  (2Trm  kT)^^^  (351) 

y  Jy  e 

sin  0^  =  0  (352) 

y  Jy 

cos  0^  =  (2iim  kT)^^^  (353) 

z  Jz  e 

sin  0^  =  0  (354) 

z  Jz 


Equations  (349)  through  (354)  give 
/  .  m^kT  exp(g  - 


-  '>x''xk 


(355) 

(356) 


=  (2Trm  kT)^^^ 


(357) 


(359) 


-  (2Tim 
z  '  e 

ej^  -  0  (360) 

Then  the  thermionic  emission  current  for  the  normal  state  of  a  high-Tc  material 
is  obtained  from  equations  (257)  and  (355)  through  (360)  to  be 

where  is  given  by  equation  (233),  and  g  ,  a  ,  c^  ,  b^  and  pxk  are  given  by  equa¬ 
tions  (320),  (321),  (314),  (317)  and  (246)  respectively.  When  all  internal 
phase  angles  are  set  equal  to  zero  equation  (361)  reduces  to  the  Richardson- 
Dushman  equation  (232)  because  for  this  case 


g°  «  u/(kT) 

(362) 

c°  -  (2m^kT)"^ 

(363) 

-  [2m^(p  +  e^)]^^2 

(364) 

a°  -  0 

(365) 

o' 

X  o 

II 

o 

(366) 

(367) 

which  corresponds  to  incoherent  spacetime. 

The  predicted  measured  v  lue  of  the  thermionic  emission  current  for  the 
normal  state  of  a  high-T^  material  is  given  by  equation  (361)  within  the  ap¬ 
proximation  of  constant  i;;ternal  phase  angles  for  the  single  particle  momenta, 
i.e.,  0px  *  ,  0py  =  6p  and  Op^,  =  0^  .  -phe  predicted  measured  value  of  the 

thermionic  emission  current  is  seen  to  have  a  leading  temperature  term  of  T^ 
that  is  augmented  by  a  temperature  dependent  exponential  and  trigonometrical 
term.  For  the  case  of  an  ordinary  metal  where  all  internal  phase  angles  are 
set  equal  to  zero  equation  (361)  reduces  to  the  standard  Richardson-Dushman 
equation  (232) • 

D.  Thermionic  Emission  from  the  Superconducting  State  of  a 
High-T  ^rial. 


The  following,  conditions  are  valid  for  the  coherent  spacetime  of  the  su¬ 
perconducting  state  of  a  high-Tj.  material 


6 


c 

XX 


6 


XC 

tt 


■n/2 

6^^  =  tt/2 

s'"  =  71/2 

(368A) 

yy 

zz 

tt/2 

^11  =  W2 

(368B) 
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It  is  further  assumed  that  equations  (106)  and  (107)  are  valid  for  the  super¬ 
conducting  state  of  a  high-Tc  material.  Then  according  to  equation  (108)  the 
following  coherence  conditions  are  valid  for  incipient  unbound  electron  pairs 
from  which  come  the  electrons  of  thermionic  emission  from  the  superconducting 
state 

“  ir/2  8*^"*"  ■  ir/2  8*^"*"  ■  Tr/2  (3691 

DXDX  nvnv  '  KJOy) 


so  that  equation  (115)  gives  with  Spjj  ,  Spy  and  Op^  as  variables 

dp*^"*^  =  “  jp  d0  exp(j6  ) 

X  •''^x  px  ''*^xc  px  px' 

dp^"*”  =  jp*^de  *  jp  d6  exp(j6  ) 

y  y  py  *^yc  py  py' 

dp*^^  =  jp*^de  *=  jp  de  exp(j6  ) 

*^z  pz  •^‘^zc  pz  pz 


(370) 


(371) 


(372) 


and  where 


p  =  2m  kT 
xc  e  cx 


(373) 


p  =  2m  kT 
yc  e  cy 


(374) 


p  =  2m  kT 
zc  e  cz 


where 


T^^  =  superconducting  transition  temperature  for  ac  plane 
T^y  =  superconducting  transition  temperature  for  be  plane 
T^^  =  superconducting  transition  temperature  for  ab  plane 


(375) 


For  a  bulk  superconductor  the  .superconducting  transition  temperature  is  given  by 


2  2  2 

P  ~P  “P  =2mkT 

xc  *^yc  zc  e  c 

T  =  T  =  T  =  T 

cx  cy  cz  c 


(376) 

(377) 


where  T^.  is  the  common  value  of  the  superconductivity  transition  temperature. 

The  complex  number  thermionic  emission  current  is  given  by  equations 
(247),  (260),  (263)  and  (266)  for  the  superconducting  state  of  a  high-Tj.  mate¬ 
rial,  where  for  this  case 


=  jp^  f  J  exp[j(0^  +  20  )]d0 

X  '^‘^xc  .  ■*  px  Jpx  px  px 

®pxk 


(378) 
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ir/6 


J®  •  jp  f  J  exp[j(6.  +  6  )ld0 

y  yc  ^  py  ^  Jpy  py  py 

ir/6 

■  Jp  /  J  exp(j(e,  +  e  )]d0 

z  ■*‘^zc  '  pz  Jpz  pz  ^  pz 


(379) 

(380) 


where 
through 
are  given  by 


Jpx  »  Jpy  •  Jpz  »  ®Jpx  *  ®Jpy  ®Jpz  equations  (270) 

h  (275).  The  real  and  imaginary  parts  of  equations  (378)  through  ( 

ii/6 


J  cos  6^  “  -  p  /  J  sin(6  +  26  )d6 

X  Jx  ‘^xc  -  px  '  Jpx  px'  px 

®pxk 


ir/6 


sin  6*:  =  p^  /  J  cos(e.  +  20  )d0 

X  Jx  ^xc  .  px  '  Jpx  px-'  px 

”pxk 


(380) 

(381) 

(382) 


■a/6 


J  cos  0^  “  “  P  /  J  sin(0-  +  0^„)d6 

y  Jy  yc  i  py  Jpy  py  py 


a/6 

J*^  sin  0*1  =  p  /  J  cos(6t  +6  )d6 

y  Jy  yc  i  py  Jpy  py  py 


(383) 

(384) 


a/6 


J  cos  0^  =  -  P  f  J  sin(6_  +  6  _)d6 

z  Jz  *^zc  ^  pz  Jpz  pz'  pz 


a/6 
o  P2 


J*^  sin  0^  =  p  /  J  cos(0^  +  6  )d6 

•»  Jz  zc  A  pz  Jpz  pz  pz 


(385) 

(386) 


c  c  c  c  c  c 

The  problem  is  to  determine  J  ,  J  ,  J  .  6  ®  t  ®  t 

X  ^  Z  JX  J  2* 

Combining  equations  (270)  through  (275)  with  equations  (381)  through 
(386)  gives 


X 

y 

i*^ 

y 

z 

z 


cos 

2  g 

p  e®(K,  cos  a  -  K_  sin  a) 
xc  1  2 

(387) 

sin 

CD 

II 

p^  e®(K,  sin  a  +  K_  cos  a) 
xc  1  2 

(388) 

cos 

®T  “ 

P  K-i 

(389) 

Jy 

*^yc  3 

sin 

®T  = 

P  K/ 

(390) 

Jy 

*^yc  4 

cos 

P  K. 
zc  5 

(391) 

sin 

= 

P  K 

zc  6 

(392) 
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(393) 


where  g  and  a  are  given  by  equations  (320)  and  (321),  and  where 

it/6 

t,  ^  f  exp(-  a  cos(20^)]8in[o  8in(26  )  -  20  ]d0 

1  ^  xc  px''*  xc  px'  px-*  px 


K-  ■  /  exp[-  a  cos(20  )]cos[a  sin(20  )  -  20  ]d0 

9  J  vr  nv'  **  vr  '  tw'  nv* 


px  px 


(394) 


K»  «  /  exp[-  a  cos (20  )]sin[a  sin(20  )  -  0  ]d0 

^  J  vr  '  nv'  •*  *“  vr  '  nv'  nv* 


py  py  py 


(395) 


K,  -  f  exp[-  a  cos(20  )]cos[a  sin(20  )  -  0  ]d0 


py  py  py 


(396) 


Kc  =  /  exp[-  a  cos(20  )]sin[a  sin(20  )  - 


0  ]d0 

pz  pz 


(397) 


K,  =  f  expf-  a  cos(20  )]cos[a  sin(20  )  - 

ft  J  ^r  '  ny.  yr  '  n^' 


0  ]d0 

pz-"  pz 


(398) 


where 


a  -  p  /(2m  kT)  =  T  /T 
xc  *^xc  '  e  '  cx 


(399) 


a  =  p  /(2m  kT)  =  T  /T 
yc  ‘^yc  e  cy 


(400) 


o  =  p  /(2m  kT)  =  T  /T 
zc  *^zc  e  cz 


(401) 


Equations  (387)  through  (391)  give 

=  p^  e®(Kj  + 

x  xc  1  2 

=  p  (K^  + 

y  *^yc  3  4 

=  p  (K^  + 

z  ‘^zc'  5  6 


(401) 


(402) 


(403) 


tan  =  (Kj  sin  a  +  K2  cos  a)/(Kj  cos  a  -  K2  sin  a) 


(405) 


tan  0 


K4/K3 


(406) 


tan  0 . 


K6/K5 


(407) 
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Equations  (402)  through  (407)  can  be  used  to  calculate  the  thermoemission  from 
the  superconducting  state  of  a  high-T^  compound. 

The  measured  thermionic  emission  current  for  a  high-T^  superconductor  is 
given  by  equation  (257) .  Several  cases  can  be  considered  according  to  whether 
the  spacetime  coordinates  are  coherent  or  incoherent.  In  all  cases  considered 
the  thermionic  emission  is  along  the  x  axis. 


Case  a.  Spacetime  Coherence  Along  All  Axis. 


Equations  (257),  (373)  through  (375)  and  (402)  through  (407)  give  the  mea¬ 
sured  thermionic  emission  current  as 


I*^*^*^  =  2e/(m  h^)j‘^j‘^j‘^  cos(0^  +  +  6^  ) 

xm  e  '  X  y  z  Jx  Jy  Jz 


(408) 


or  equivalently 

I*^*^*^  =  2/irA  T  (T  T  )^^^  e®  G 


xm 


o  cx  cy  cz 


123456  C 


(409) 


where  Aq  is  the  Rlchardson-Dushman  constant  given  by  equation  (233);  »  "^cy 
and  Tcz  are  the  superconductivity  transition  temperatures,  g  is  given  by  equa¬ 
tion  (320) ,  and  where 


^123456 

.ccc 
9  “ 

xyz 


2  2  2  2  2  2 
=  [  (K^  +  Kp  (K3  +  Kp  (K5  +  Kg) 


Jx 


+  0^  +  0 
Jy 


Jz 


1/2 


(410) 

(411) 


c 

where  through  K5  are  given  by  equations  (393)  through  (398),  and  where  , 
0j  and  0j2  are  given  by  equations  (405)  through  (407).  Equation  (409)  gives 
the  measured  thermionic  emission  current  for  the  superconducting  state  and  is 
valid  for 


T  <  T 


cx 


T  <  T 

cy 


T  <  T 

cz 


(412) 


For  the  case  of  a  bulk  superconductor  (bs)  equation  (377)  is  valid  and  equation 
(409)  becomes 


=  2/-rrA  e®  cos  9^® 

xm  o  c  123456  xyz 


where  now 


K-  =  K.  K.  =  K, 

3  5  4  6 

^  +  kJ) 


Jy 


=  0 


Jz 


(413) 


(414) 

(415) 


+  2  0^ 
Jy 


(416) 
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which  is  valid  for  T  <  .  The  leading  tonperature  terma  in  equations  (409)  or 

(413)  are  functions  of  the  superconducting  transition  temperatures  and  are  not 
related  to  the  term  that  appears  in  the  Richardson-Dushman  equation  that  de¬ 
scribes  thermionic  emission  from  ordinary  metals.  The  temperature  dependence 
of  the  thermionic  emission  current  comes  essentially  through  the  functions  eS 
and  K|  through  .  Equations  (320),  (393)  through  (398),  (409)  and  (410)  show 
that 


e 


8 


^123456  "  ° 


T  -»•  0 


(417) 


so  that  there  is  no  thermionic  emission  at  T  =  0 

l‘^‘^‘^(T  =  0)  =  0 
xm 


(418) 


The  functions  Kn(T)  increase  slowly  with  temperature  from  their  values  of 
K^i(O)  »  0  at  T  »  0  . 

Case  b.  Spacetime  Coherence  Along  the  Emission  Axis,  and  Spacetime 
Coherence  and  Incoherence  Along  the  Two  Perpendicular  Axes. 


For  this  case  equation  (257)  is  written  as 

=  2e/(m  h^)j‘^j‘^J^  cos(e^  +  0^  +  6^  )  (419) 

xm  e  X  y  z  Jx  Jy  Jz 

where  »  Jy  and  J2  are  given  by  equations  (402) ,  (403)  and  (353)  while  Ojx  , 
0jy  and  0^2  given  by  equations  (405),  (406)  and  (360).  Equation  (419)  can 
be^ rewritten  as 


=  2/»^A  T  e®  G  cos 

xm  o  cx  cy  1234  xyz 


(420) 


where 

^234  =  [(Ki  +K2^(K3  +  K^]^^^ 


(421) 


<(> 


cci 

xyz 


cci 

0^  +0^  +  9^ 

Jx  Jy  Jz 


®T  + 

Jx  Jy 


(422) 


For  the  case  when  T^x  =  T^cy  =  'Tc  equation  (420)  becomes 


I 


cci 

xm 


2//nA.  e®  G 


o  c 


1234 


cos  <j) 


cci 

xyz 


(423) 


If  the  incoherent  spacetime  axis  is  taken  to  be  the  y  axis  the  thermionic 
emission  current  is  given  analogously  to  equations  (419)  and  (420)  as 
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(424) 


m  2e/(m  cos(0^  +0^  +  0  t  ) 

xm  e'xyz  Jx  Jy  Jz 

-  2/i^A  T  e*  G  cos 

o  cx  cz  1256  xyz 


(425) 


where 


[(Kj  +  K^)(K5  + 


(426) 


Jx  J 


t  +  0; 

Jy  Jz 


(427) 


®  T  +  ®  T 

Jx  Jz 


Again,  for  the  case  T^x 


Tg  equation  (425)  becomes 


-  2//irA  e®  G,,,,  cos 

xm  o  c  1256  xyz 


(428) 


1/2 

which  is  essentially  equivalent  to  equation  (423).  Note  the  leading  T  '  be¬ 
havior  of  the  thermionic  emission  current  in  equations  (420),  (423),  (4?.5)  and 
(428)  which  is  different  from  the  behavior  associated  with  the  Richardson- 
Dushman  equation  for  ordinary  metals.  The  behavior  may  possibly  be  ex¬ 

perimentally  verified  in  the  high-T^  compounds.  Equations  (420)  and  (425)  are 
valid  for  T  <  T^x  »  T  <  Tcy  and  T  <  Tcz  »  while  equations  (423)  and  (428)  are 
valid  for  T  <  Tc  . 

Case  c.  Spacetime  Coherence  Along  the  Emission  Axis,  and  Spacetime 
Incoherence  Along  Both  Perpendicular  Axes. 


In  this  case  equation  (257)  becomes 

I*^^^  =  2e/(m  h^)j‘^J^J^  cos(e^  +8^  +  8^  ) 

xm  e  X  y  z  Jx  Jy  Jz 


(429) 


c  1  i  c 

where  J^^  ,  Jy  and  J^  are  given  by  equations  (402),  (357)  and  (359),  and  where  Sjj^  , 

8^y  and  8j2  are  given  by  equations  (405),  (358)  and  (360).  Equation  (429)  can 

be  rewritten  as 


l'^^^  =  2A  T  T  e®  G,  „  cos 
xm  o  c  12  xyz 


(430) 


where  and  where 

Gi2  =  (kJ  + 


(431) 


c  i  i 

0^  +8^  +  8^ 

Jx  Jy  Jz 


(432) 
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For  this  case  the  leading  temperature  dependent  term  Is  T  which  possibly  could 
be  experimentally  verified  In  special  classes  of  hlgh-Tc  materials.  Equation 
(430)  Is  valid  for  T  <  Tc  • 

Case  d.  Incoherent  Spacetime  Along  the  Emission  Axis,  and 
Spacetime  Coherence  Along  Both  Perpendicular  Axes. 


For  this  case  equation  (257)  is  written  as 
,lcc 


xm 


cos(6j^  +  +  6=^) 


(433) 


where  ,  Jy  and  are  given  by  equations  (355),  (403)  and  (404)  and  0jjj  ,  0jy 
and  0j2  are  given  by  equations  (356),  (406)  and  (407).  Then  equation  (433)  can 
be  rewritten  as 


=  A  e®'  exp 

xm  o  cy  cz  3456  xyz 


where 


g' 


g  -  c  p 
®  x'^xc 


2  2  2  2  1/2 
^3456  =  [(^3  +  V<''5'' VI 


(434) 

(435) 

(436) 


=  0^  +0*:  +0^ 

xyz  Jx  Jy  Jz 


If  T_„  =  Tj.2  =  then  equation  (434)  simplifies  to 


cy 


=  A  /tiT  T  e®'  G  ,  ,  cos 
xm  o  c  3456  xyz 


(437) 


(438) 


Equation  (434)  is  valid  for  T  <  Tcy  and  T  <  Tcz  >  while  equation  (438)  is  valid 
for  T  <  Tc  .  Note  the  linear  dependence  of  the  leading  temperature  dependent 
terms  in  equations  (434)  and  (438) . 


Case  e.  Incoherent  Spacetime  Along  Emission  Axis,  and  Spacetime 

Coherence  and)  Incoherence  Along  the  Two  Perp^nllcular  Axes.^ 

In  this  case  equation  (257)  is  written  as 

I^^^^  =  2e/(m  h^)J^J^j‘^  cos(0^  +0^  +  0 t  >  (^39) 

xm  e  xyz  Jx  Jy  Jz 

where  J^  ,  Jy  and  J^  are  given  by  equations  (355),  (357)  and  (404),  while  0jx  , 
0jy  and  0^2  sre  given  by  equations  (356),  (358)  and  (407).  Equation  (439)  can 
be  rewritten  as 


I 


iic 

xm 


l/2^3/2 

cz 


cos  (|) 


iic 

xyz 


(440) 


where  g'  is  given  by  equation  (435)  and  where 
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®56  ■ 


-  0^  +0^  +0*: 

xyz  Jx  Jy  Jz 


0^  +  0^ 

Jx  Jz 


(4A1) 

(442) 


For  this  case  the  leading  temperature  term  of  the  thermionic  emission  current 
is  T^/2  _  jf  (.jjg  coherent  spacetime  axis  is  selected  to  be  the  y  axis  the 
thermionic  emission  current  is  given  by 


lici 

xm 

(443) 

=  A  //fTT^^V^^  e®'  G.,  cos 

o  cy  34  xyz 

(444) 

where 

=34  - 

(K3  + 

(445) 

- 

xyz 

(446) 

For  this  case  the 


leading  temperature  dependent  term  is 


t3/2  _ 


From  equations  (409),  (413),  (420),  (425),  (430),  (434),  (438),  (440), 
and  (444)  it  is  clear  that,  depending  on  whether  spacetime  is  coherent  or  inco¬ 
herent  along  the  emission  axis  and  along  the  two  perpendicular  axes,  the  lead¬ 
ing  temperature  terms  of  the  thermionic  emission  current  for  the  superconducting 
state  of  a  high-T^.  material  can  be  either  T°  ,  t1/2  ,  T  or  , 


4.  CONCLUSION.  The  superconducting  state  (T  <  Tj.)  of  a  high-Tc  material 
is  associated  with  coherent  spacetime  while  the  normal  state  (T  >  T^)  is  relat¬ 
ed  to  partially  coherent  spacetime.  For  the  superconducting  state  the  Cooper 
electrons  rotate  incoherently  or  coherently  but  their  radial  motion  in  space 
and  time  is  in  a  coherent  state.  The  motion  of  the  electrons  under  a  weak  at¬ 
tractive  inverse  square  force  is  non-Keplerian  because  the  rotational  period  is 
independent  of  the  separation  distance  of  the  two  electrons  in  a  Cooper  pair. 

The  thermionic  emission  currents  for  the  normal  and  superconducting  states  of  a 
high-Tc  compound  have  been  calculated.  For  the  normal  state  of  a  high-Tc  mater¬ 
ial  the  thermionic  emission  current  has  a  Richardson-Dushman  form  that  is  mod¬ 
ified  by  an  exponential  and  cosine  term,  but  for  the  superconducting  state  the 
thermionic  emission  current  has  a  completely  different  form  and  does  not  have 
the  leading  quadratic  temperature  dependence  of  the  Richardson-Dushman  equation 
but  rather  goes  as  T®  ,  T^/2  ^  j  or  T^'^  depending  on  whether  spacetime  is  co¬ 
herent  or  incoherent  along  the  thermionic  emission  axis  and  along  the  two  per¬ 
pendicular  axes. 
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SLOW  AND  ULTRAFAST  DYNAMICAL  SYSTEMS 


Richard  A.  Weiss 

U.  S.  Amy  Engineer  Waterways  Experlnent  Station 
Vicksburg,  Mississippi  39180 

ABSTRACT .  Newton's  law  of  dynamics  is  written  in  a  complex  number  fom 
that  is  valid  for  space  and  time  coordinates  that  exhibit  broken  internal 
symmetries.  The  space  and  time  coordinates  and  therefore  also  the  potential 
function  are  written  as  complex  numbers  in  an  internal  space.  This  allows  the 
possibility  of  motion  in  space  and  time  where  the  magnitudes  of  the  complex 
number  space  and  time  coordinates  change  «ihile  the  phase  angles  reoiain  constant, 
and  the  possibility  of  internal  space  motions  where  the  phase  angles  of  the  space 
and  time  coordinates  vary  while  the  magnitudes  of  the  space  and  time  coordinates 
are  constants  which  corresponds  to  the  case  of  internal  rotations  in  spacetime. 
Similarly  the  magnitude  of  a  complex  number  potential  can  change  while  the 
internal  phase  angle  of  the  potential  remains  fixed  as  in  the  case  a  a  slow 
mechanical  process,  or  the  internal  phase  angle  of  the  complex  nusd>er  potential 
can  change  for  a  fixed  magnitude  of  the  potential  which  is  the  case  of  an 
ultrafast  mechanical  process.  This  leads  to  eight  possible  forms  for  Newton's 
dynamical  law  of  motion  corresponding  to  slow  and  ultrafast  mechanical  processes 
in  coherent  or  incoherent  space  and  for  coherent  and  incoherent  time. 

1 .  INTRODUCTION .  Classical  mechanics  is  formulated  against  a  background 
of  space  and  time  which  is  represented  as  an  inert  continuous  medltan.  Bodies  in 
motion  are  represented  as  trajectories  in  the  space  and  time  background.  In  this 
context  space  and  time  is  a  mathematical  construct  that  itself  has  no  reality 
other  than  geometry  because  space  and  time  coordinates  can  be  chosen  in  an 
arbitrary  manner.  However,  the  development  of  quantum  field  theory  has  suggested 
that  the  vacuum  is  a  real  physical  medixim  having  well  defined  properties. The 
vacuum  state  influences  the  matter  within  the  vacuum  through  such  effects  a 
vacuum  polarization  and  the  self  energy  of  fundamental  particles.^"® 

The  reality  of  the  vacuum  state  suggests  that  space  and  time  coordinates 
may  be  more  than  just  mathematical  constructs,  and  it  has  been  suggested  that 
space  and  time  coordinates  are  complex  numbers  in  an  internal  space.*  The  nature 
of  the  complex  number  coordinates  has  its  origin  in  the  relativistic  trace 
equation  which  implies  that  pressure  is  a  complex  number  in  an  internal  space. 
The  internal  phase  angles  of  the  space  and  time  coordinates  are  affected  by  the 
presence  of  external  fields  such  as  gravity  and  electromagnetism.*  The  concept 
of  complex  ntimber  spacetime  allows  the  possibility  of  coherent  motion  in 
spacetime  wherein  the  complex  number  coordinates  rotate  in  an  internal  space  with 
fixed  magnitudes  of  the  space  and  time  coordinates  and  where  the  internal  phase 
angles  of  the  space  and  time  coordinates  are  now  the  dynamical  variables  of 
motion.  Expressions  for  the  velocity  and  acceleration  of  particles  undergoing 
inteimal  spacetime  motions  have  already  appeared  in  the  literature.*’®  This 
paper  introduces  the  concepts  of  slow  and  ultrafast  mechanical  processes.  A  slow 
mechanical  process  occurs  when  the  magnitude  of  the  complex  number  potential 
function  varies  while  the  phase  angle  of  the  potential  remains  constant.  The 
case  of  an  ultrafast  mechanical  process  exists  when  the  complex  number  potential 
function  rotates  in  an  internal  space  with  the  magnitude  of  the  potential  held 
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fixed  while  the  phase  angle  of  the  potential  changes. 


The  space  and  time  coordinates  must  be  written  as  complex  numbers  in  in¬ 
ternal  space  as  fellows'*’^ 

V  -  V  exp(je^)  (1) 

t  *  t  exp(j6^)  (2) 

where  v  =  x  ,  y  ,  z  for  cartesian  coordinates;  r  ,  (J)  ,  z  for  cylindrical  polar  co¬ 
ordinates;  and  p  ,  <j)  ,  for  spherical  polar  coordinates.  Strictly  speaking  the 
internal  phase  angle  of  time  6^  is  associated  with  each  space  coordinate  so  that 
for  the  three  elanentary  coordinate  systems  the  internal  phase  angles  of  time 


0*  .  ey  .  0^ 

t  t  *  t 


ftp  ft*** 

®t  ’  ®t  *  ®t 


The  following  quantities  often  appear  in  the  calculations  involving  partially 
coherent  broken  symmetry  space  and  time***® 

tan  0  =  v30  /3v  (4) 

vv  V 

tan  6^  =  t30^/3t  (5) 

tt  t 

as  for  example  in  the  following  differentials'*’® 

dv  =«  sec  6^^  dv  exp[j(0^  +  3^^)1  (6) 

=  CSC  B  vd0  exp[j(0  +  B  )1 
vv  V  V  vv 

dt  =  sec  B^^  dt  exp[j(0^  +  3^^)]  (7) 

=  CSC  B^^  td0^  exp[j(0^  +  3^^.)] 

For  Incoherent  spacetime  Bvv  “  0  snd  Btt  ®  ^  while  for  coherent  spacetime 
3vv  =  ''r/2  and  B^t  =  1^/2  .  The  measured  coordinates  of  space  and  time  are 
given  by**’® 


V  =  V  cos  0 
m  V 


t'^  =  t  cos  Q't 
m  t 


The  complex  number  coordinate  speed  is  obtained  from  equations  (6)  and  (7)  as 

V,,  =  V  exp(j0  )  =  dv/dt  (9] 

V  V  vv 


where 
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(10) 


-  cos  6^^  sec  dv/dt 


-  cos  6^^  CSC  Byy  V  de^/dt 

*  sin  sec  B  t  ^  dv/dB^ 
tt  vv  t 

■  sin  B^^  CSC  B  v/t  d6  /de^ 
tt  vv  V  t 

where 


e  =  e  +  B 

vv  V  vv 


(11) 

(12) 

(13) 

(lA) 


and  where  v  =  x,y,z;r,4'»z;orp,^,i(t  .  The  particle  velocity,  momentum 
and  energy  are  written  as  complex  numbers  in  internal  space  as  follows***® 


V  exp(ie  ) 

V  w 


Pv  =  Pv  ®*P<J®pv^ 


E  -  E  expCjGg) 


and  the.  corresponding  measured  quantities  are 


V  =  V  cos  6 
vm  V  vv 


p  =  p  cos  e 
vm  V  pv 


E  =  E  cos  e_ 
m  E 


which  are  the  real  values  of  the  complex  number  quantities. 

The  basic  interest  in  the  realization  of  complex  number  spacetime  by  the 
application  of  external  fields  is  the  possibility  of  internal  motions  in  space- 
time  which  may  perhaps  eventually  be  used  for  developing  power  sources  such  as 
rocket  engines.  The  essential  outline  of  this  paper  is  as  follows:  Section  2 
outlines  the  subject  of  kinematics  in  spacetime  with  broken  internal  symmetries, 
and  Section  3  considers  Newton's  law  of  motion  for  slow  and  ultrafast  mechanical 
processes  in  asymmetric  spacetime. 

2.  KINEMATICS  IN  SPACE  AND  TIME  WITH  BROKEN  INTERNAL  SYMMETRIES.  This 
section  develops  the  basic  expressions  for  particle  speed  and  acceleration  in 
spacetime  with  broken  internal  symmetries.  The  concept  of  motion  in  totally 
coherent  spacetime  is  introduced.  The  connections  are  made  between  the  mea¬ 
sured  kinematical  quantities  of  partially  coherent  spacetime  and  the  conven¬ 
tionally  calculated  kinematical  quantities  of  incoherent  spacetime. 

A.  Particle  Speed. 

The  speed  of  a  particle  in  broken  symmetry  spacetime  Is  obtained  from 
equations  (9)  through  (14)  as***® 

V  =  V  exp(j6  )  =  dx/dt  (17) 

X  X  vx 


where 
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and 


vx 


cos 

6* 

•^tt 

sec 

®xx 

dx/dt 

cos 

CSC 

X  d0  /dt 

tt 

XX 

X 

sin 

sec 

8 

t~^  dx/d0* 

tt 

XX 

t 

sin 

CSC 

x/t  d0  /d0^ 

tt 

XX 

X  1 

0 

+  S 

<  ■ 

■  e"". 

X 

XX 

t 

tt 

(18) 

(19) 

(20) 
(21) 

(22) 


where  3xx  ^tt  given  by  equations  (4)  and  (5)  respectively.  The  mea¬ 
sured  particle  speed  Is  given  by  equations  (16)  and  (18)  through  (21)  as^ 


V  »  V  cos  0 
me  X  vx 


The  single  particle  momentum  Is  then  given  by 


p  “  p  exp(i6  ) 
*^x  *^x  px 


p  =  mv 

*^X  X 


0  =0 
px  vx 


(23) 


(24) 


Equations  similar  to  ,(17)  through  (22)  can  be  developed  for  the  y  and  z  coor¬ 
dinates. 

The  special  forms  of  the  particle  speed  for  the  four  possible  spacetime 
states  will  now  be  presented. 


Case  a.  Incoherent  Space  and  Incoherent  Time. 


0=0 

X 


6  =  0 
XX 


-  0 


Combining  equations  (18)  and  (25)  gives 


v^^  =  dx/dt 
X 


0^^  =  0 
vx 


(25) 


(26) 


which  is  the  conventional  description. 

Case  b.  Coherent  Space  and  Incoaerent  Time. 


8  =  Tr/2 

XX 


e*.o 


6*  =  0 
tt 


Combining  equations  (19)  and  (27)  gives 


v^^  =  xd6  /dt 

X  X 


9*^^  =  e  +  7r/2 
vx  X 


(27) 


(28) 


which  describe  internal  space  motion  In  external  time. 
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Case  c.  Incoherent  Space  and  Coherent  Tine. 


e  -  0  e  -  0  6*  -  ir/2 

X  XX  tt 


Combining  equations  (20)  and  (29)  gives 


ic  /joX 

V  “  t  dx/d6 
X  t 


»  -  e*  -  it/2 

vx  t 


which  describes  an  internal  time  motion  in  external  space. 
Case  d.  Coherent  Space  and  Coherent  Time. 


B  =  Tr/2 

XX 


6  =  ir/2 

tt 


Combining  equations  (21)  and  (31)  gives 


v*^  =  x/t  de  /d0^ 
X  x  t 


0^  =0  -  0* 
vx  X  t 


which  describes  internal  motion  in  both  space  and  time. 

B.  Particle  Acceleration. 

For  a  broken  symmetry  spacetime  the  particle  acceleration  is  given 
equations  (17)  through  (22)  as**'® 

a^  =  a^  exp(je^)  =  dv^/dt  =  d^x/dt^ 

where® 

a  =  sec  8  cos  8. .  dv/dt 
X  vxvx  tt  X 

=  CSC  8  cos  8*  V  d0  ^/dt 
vxvx  tt  X  vx 

=  sec  8  sin  8*  t  ^  dv  /d0* 
vxvx  tt  X  t 

=  CSC  8  sin  8*  v  /t  d0  /d0* 
vxvx  tt  X  vx  t 


0  =0  +8  -  e*  -  8* 

ax  vx  vxvx  t  tt 


where' 


tan  8  =  V  30  _/3v 

vxvx  X  vx  X 


From  Newton's  law  of  motion  written  as 
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F  ■  F  exp(j6_  )  ■  mi 

X  X  Fx  X 


it  follows  that 


0^-0 
Fx  ax 


The  measured  acceleration  Is  given  by 


a  cos  0 
X  ax 


The  measured  force  Is  given  by  equations  (40)  and  (41)  as 


F  cos  0_ 
x  Fx 


Coniblnlng  equations  (41)  through  (43)  gives 


and  therefore  the  measured  acceleration  Is  determined  from  the  measured  force 
by  Newton's  law  of  motion.  The  values  of  the  acceleration  magnitude  ax  and  ac¬ 
celeration  Internal  phase  angle  0ax  will  now  be  calculated  for  several  caS'^  of 
the  broken  symmetry  of  space  and  time. 

Case  a.  Incoherent  Space  and  Incoherent  Time. 

A  general  expression  for  the  acceleration  Is  developed  that  can  be  used 
to  deduce  the  limiting  case  of  Incoherent  space  and  Incoherent  time  which  Is 
described  by 


0=0 

X 


B  =  0 

XX 


0^  =  0 


The  appropriate  expressions  for  the  acceleration  magnitude  and  Internal  phase 
angle  are  deduced  from  equations  (34)  and  (38)  to  be^ 

a  =  sec  B  cos  B*^  dv  /dt  (46) 

X  vxvx  tt  X 

0  =0  +  B  -  e*  -  (47) 

ax  vx  vxvx  t  tt 


where 


tan  B  =  V  30  /3v 

vxvx  X  vx  X 


Combining  equations  (18)  and  (46)  gives® 

a  =  cos  B*  sec  B  d/dt(cos  8*  sec  B_  dx/dt) 

X  tt  vxvx  tt  XX 


An  alternative  expression  for  the  acceleration  obtained  from  equations  (18) 
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and  (35)  as 


*x  ■  *vxvx  “•  ®tt  'x  "vx/*"'  <5®> 

“  ®vxvx  “*  ®tt  ®tt  ““  ®xx 

■  ®vxvx  »“  ®xx 

Combining  equations  (22)  and  (47)  gives  the  phase  angle  of  the  acceleration  as 


e  «e+3  +3  -  2(6  +  3  ) 

ax  X  '^xx  vxvx  '  t  tt 


(51) 


When  the  conditions  In  equation  (45)  are  valid  the  epxresslon  In  equation  (49) 
reduces  to  the  standard  case  of  Incoherent  space  and  Incoherent  time. 

Case  b.  Coherent  Space  and  Incoherent  Time. 

General  expressions  for  the  acceleration  are  now  deduced  which  can  be  used 
to  make  the  transition  to  the  case  of  coherent  space  and  Incoherent  time  which 
Is  desclrbed  by 


3, 


XX 


ir/2 


9*.0 


3' 


tt 


(52) 


An  appropriate  general  expression  for  the  acceleration  magnitude  and  Internal 
phase  angle  for  this  case  Is  obtained  from  equations  (19)  and  (34)  to  be^ 

a  *  sec  3  cos  3*  dv  /dt  (53) 

X  vxvx  tt  X 

*  ®vxvx  ^t  ^tt  ^xx  * 

An  alternative  form  of  the  acceleration  Is  obtained  from  equation  (35)  as 

a  “  CSC  3  cos  3*^  v  de  Idt  (54) 

X  vxvx  tt  X  vx 

where  v-  is  given  by  equation  (19).  Combining  equations  (19)  and  (54)  gives 


2  X 

a  =  CSC  3  cos  3^^  esc  3  x  de  /dt  d0  /dt 
X  vxvx  tt  XX  X  vx 

where  from  equation  (22) 

de  /dt  =  d/dt(e  +3  -  0*  -  e*  ) 

vx  '  X  XX  t  tt 

The  acceleration  phase  angle  Is  obtained  from  equation  (51)  as 
e  =6+3  +3  -  2(0*  +  3*  ) 

ax  X  XX  vxvx  t  tt 


(55) 


(56) 


(57) 
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For  the  case  at  hand  it  is  convenient  to  write  the  acceleration  In  equation 
(33)  for  negative  acceleration  (attractive  forces)  as 

*x  "  *x  **P<J®ax^  “  ®x 
■  dv  /dt  ■  d^/dt^ 

X 

where 


t 

1 

X 


0 

ax 

so  that 

t 


6 

ax 


IT 


-  CSC  0 


vxvx 


“  CSC  6 


vxvx 


cos  de^/dt 

cos^B*,  CSC  X  do  /dt  d6  /dt 

tt  XX  X  vx 


(59) 

(60) 


(61) 

(62) 


ax 


0+6  +6  -  2(0*  +  6*  )  -  It 

X  ^xx  vxvx  '  t  tt 


(63) 


which  Is  an  equivalent  description  of  the  acceleration. 


For  the  special  case  of  coherent  space  and  Incoherent  time,  equations 
(52) ,  (56) ,  (62)  and  (63)  give 


a  =  -  CSC  6  x(d0  /dt) 
x  vxvx  X 


0*^^'’’  =  0  +  6*^'^  -  ti/2 

ax  X  vxvx 


(64) 

(65) 


where  from  equations  (19),  (22),  (48)  and  (52)  It  follows  that 
cl 


V  =  X  d6  /dt 
X  X 


0*^^  =  0  +  ir/2 

vx  X 


tan 

vxvx  xt  xt 


(66) 

(67) 

(68) 


where 


=  (d0  /dt)^  >  0 

xt  X  xt 

-  d^0  /dt^  <  0 

xt  X  xt 

Equation  (68)  then  gives 


(69) 

(70) 
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(71) 


«<=  +  (O'’*'" 

Because  fS^  <  0  it  follows  from  equation  (68)  that 


■^xt 

jCi 

vxvx 


w/2  +  ic 


xt 


where  K^t  ^  0  is  a  small  number  which  Is  also  given  by 


tan  f  ^ 

xt  '  xt'  xt 


Combining  equations  (64) «  (65),  (71)  and  (72)  gives 
ci+ 


a 


6 


X 

cit 

ax 


-  x[(E‘^b^  +  (F^J)^]^^^ 

'  xt  xt 


e  +  K  ^ 

X  xt 


(72) 

(73) 

(74) 

(75) 


The  angle  Is  generally  a  small  number.  Clearly  the  acceleration  In  this 
case  Is  directed  opposite  to  the  displacement  x  .  Finally,  from  equations  (58) 
and  (64)  through  (75)  it  follows  that  for  a  negative  acceleration  (attractive 
forces) 


X  '  xt  '*  xt 


(76) 


The  positive  acceleration  of  a  particle  moving  under  the  Influence  of  a 
repulsive  force  In  coherent  space  and  coherent  time  Is  given  by 


-  iCE'f  -  JF'i*) 

X  xt  ■'  xt 

X  xt  xt 


with 


ax 


xt 

pCif 

xt 


tan  3 


e  +  -  ir/2 

X  vxvx 


xt 

xt 


ci+ 

vxvx 


>  0 
xt 

<  0 
xt 


ci+/  ci+ 
xt  xt 


vxvx 


vxvx 


Tr/2  +  tc 


xt 


(77) 

(78) 

(79) 

(80) 
(81) 

(82) 

(83) 


so  that 
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0Ci+  ^  gCit  _  0  +  K  . 

ax  ax  X  xt 

-ci+  -cl  -/-ci  4»ci. 

a  -  -  a  ■  x(E  ^  -  jF 
X  X  '  xt  xt 


Case  c.  Incoherent  Space  and  Coherent  Time. 


An  expression  for  the  acceleration  Is  now  derived  from  which  the  limiting 
case  of  Incoherent  space  and  coherent  time  can  be  obtained.  This  limiting  case 
Is  described  by 


g  -  0 
XX 


8tt  - 


The  required  general  expression  for  the  acceleration  magnitude  and  acceleration 
phase  angle  Is  obtained  from  equations  (36)  and  (51)  as^ 

a  -  sec  6  sin  6*  t“^  dv  /de*  (87) 

X  VXVX  tt  X  t  ' 

0  -0+6  +6  -  2(0*  +  6*  )  (88) 
ax  X  VXVX  XX  ^  t  tt'  ' 

Combining  equations  (20)  and  (87)  gives 

a^  -  sec  3^^  sin  6*^  t“^  d/d0*(sln  6*^  sec  3^^  t  ^  dx/d0*)  (89) 

From  equations  (20)  and  (37)  It  follows  that 

a  =  CSC  $  sin  0*  v  /t  d0  /d0*  (90) 

X  VXVX  tt  X  vx  t 

=  CSC  0  sin^0*.  sec  0  t  ^  dx/d0*  d0  /d0*  (91) 

VXVX  tt  XX  t  vx  t 

For  this  case  it  is  convenient  to  rewrite  the  acceleration  equation  (33)  in  the 
following  form 

®x  "  ®x  ®*P^j®ax^  “  ^x  “  dv^/dt  =  d^x/dt^  (92) 

where 

a'  =  -  a  (93) 

X  x 

0'  =0  +  TT  (94) 

ax  ax 


so  that  an  equivalent  representation  of  the  acceleration  that  is  useful  for  at¬ 
tractive  forces  and  negative  accelerations  is  obtained  from  equations  (93)  and 
(94)  and  (87)  through  (89)  as 


a'  =  -  sec  0  sin  0*  t  ^  dv  /d0* 

X  VXVX  tt  X  t 


-  sec  sin  0*^  t  ^  d/d0*(sin  0*^  sec  0^  t  ^  dx/d0*)  (96) 
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(97) 


0'-8  +B  +6  -  2(6*  +  e*  )  +  « 

ax  ax  XX  ‘^vxvx  '  t  tt' 


where  x  and  6*  are  variables. 

In  the  case  of  lnc<^erent  space  and  coherent  tine,  equations  (86),  (96) 
and  (97)  give 

ic’ 
a 
x 

.ic’ 


flic  -2  .2 

-  sec  8  t  d  x/d0^ 
vxvx  t 


0"''  -  6^*^  -  20* 
ax  vxvx  t 


(98) 

(99) 


where  0^'  is  a  small  number,  and  where  from  equations  (20),  (22),  (48)  and 
(86)  It  follows  that 


Ic 


t  ^  dx/d0* 


0^^  «  -  e*  -  ii/2 
vx  t 

tan 

vxvx  xt  xt 


where 


-  -  dx/d0* 

xt  t 

-  d^x/d0*^ 
xt  «  t 


>  0 
xt 

F^^  >  0 

xt 


(100) 

(101) 

(102) 

(103) 

(104) 


sec  8^*^  -  (F^^)‘h(E^®)^  +  (F^S^l^^^ 

vxvx  xt  xt'  xt 


(105) 


Because  >  0  and  F^^  >  0  it  follows  that  8j^yjj  is  a  small  positive  number 
for  this  case.  Equations  (98)  and  (105)  give 


(106) 


Finally  from  equations  (92)  and  (98)  through  (106)  it  follows  that  for  an  at¬ 
tractive  force  with  a  negative  acceleration 


(107) 


For  the  case  of  a  positive  acceleration  due  to  a  repulsive  force  acting 
in  Incoherent  space  and  coherent  time  the  acceleration  is  given  by 


-ic+ 
a  ” 

X 

(108) 

lc+ 
a  = 

X 

(109) 

ax 

^i^:x  -  2«t 

(110) 

497 


with 


xt  xt 

xt  xt 


tan  6 


lc+ 

vxvx 


ic+/  lc+ 
xt  '  xt 


a 


ic+ 

vxvx 


a. 


Ic 

vxvx 


>0 

(111) 

(112) 

(113) 

(114) 

80  that  for  positive  accelerations 

-  a^*^  -  20* 

ax  ax  vxvx  t 

m  -  m  +  jE^^) 

X  X  '  xt  xt' 


ic+ 


-  a 


ic' 


(115) 

(116) 


Case  d.  Coherent  Space  and  Coherent  Time. 

In  this  section  an  equation  for  the  acceleration  of  a  particle  is  obtain¬ 
ed  which  can  be  used  to  attain  the  limit  of  coherent  space  and  coherent  time 
which  is  defined  by 


a. 


XX 


iv/2 


a‘ 


tt 


it/2 


(117) 


The  general  expression  for  the  magnitude  and  internal  phase  angle  of  the  accel¬ 
eration  is  obtained  from  equations  (37)  and  (51)  as^ 


ax 


CSC  a  sin  a*  V  /t  d0  /d8* 

vxvx  tt  x'  vx  t 


0  +  a  +  a  -  2(9*  +  B*  ) 

X  vxvx  XX  '  t  tt 


(118) 

(119) 


Combining  equations  (21)  and  (118)  gives  the  acceleration  magnitude  as 

a  =  CSC  a  sin^B*  esc  B  x/t^  d6  /dO*  dS  /dO* 

X  vxvx  tt  XX  X  t  vx  t 


(120) 


where  from  equation  (22)  it  follows  that 


de^/de*  .  de^/de;;  -  i  +  d/d6;;(B^  - 


(121) 


Another  expression  for  the  acceleration  magnitude  can  be  obtained  from  equations 
(21)  and  (36)  which  gives 

a^  -  sec  a^^  sin  a*^  t  ^  d/de*(sin  a*^.  esc  a^^^^  x/t  d0^/d0*)  (122) 
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In  the  present  case  it  is  convenient  to  write  the  acceleration  in  equation  (33)  as 


i  -  a  exp(j6  )  ■  a'  exp(J6'  )  “  dv  /dt  ■  d^/dt^ 
X  X  ax'  X  ax  x 


(123) 


where 


a'  ■  -  a 

X  X 

6'  "0  +  If 

ax  ax 


(124) 

(125) 


and  an  alternative  representation  of  the  acceleration  that  is  suitable  for  at¬ 
tractive  forces  is 


a'  «  -  CSC  B  sin^B*  esc  B^  x/t^  d0  /d0*  d0  /d0* 

X  VXVX  tt  XX  X  t  vx  t 


(126) 


-  -  sec  B,__  sin  B*  t"^  d/d0*(sin  B^^  esc  B^^  x/t  d0^/d0*)  (127) 

VXVX  tC  C  tt  XX  X  t 


0’  -  0  +  B  -  e*  -  B*  +  It 

ax  vx  VXVX  t  tt 


(128) 


0  +  B  +  -  2(0*  +  B*  )  +  It 

X  VXVX  XX  t  tt 


For  the  case  of  coherent  space  and  coherent  time  equations  (126)  through 
(128)  become  with  the  help  of  equation  (117) 


a'  -  -  CSC  B*^  x/t^  d0  /d0*(d0^/d0*  -  1) 
X  VXVX  .X  t  X  t 


-  sec  B*^  x/t^  d^0  /d0*^ 

VXVX  X  t 


(129) 


(130) 


0  +  B*^  -  20*  +  ft/2 

X  VXVX  t 


(131) 


From  equations  (21)  and  (22)  it  follows  that 


v*^  *  x/t  d6  /d0* 

X  X  t 


(132) 


0-0* 
X  t 


(133) 


Equations  (48),  (132)  and  (133)  give 
tan  B*^  =  /F^^. 

VXVX  xt  xt 


(134) 


where  for  coherent  spacetime 

^xt  -  -  1) 

-  d^e  /de*^ 

Xt  X  t 


<  0 
xt 


F*^  >  0 
xt 


(135) 


(136) 


499 


(137) 


CSC  e®  -  (E*^  )"^[(E%)^  + 

VXVX  Xt'  xt  '  xt  ' 

sec 

VXVX  xt'  ‘ '  xt  xt 


(138) 


where  d6j^/d6jE  <  1  .  In  a  gravitational  field,  for  example,  the  following  rela¬ 
tionship  holds'^ 


26*  -v.  36 

t  X 


d6  /d6*  'v.  2/3 
X  t 


(139) 


Therefore  In  general  <  0  and  nt>0  ,  and  it  Is  convenient  to  Introduce  a 
new  angle  by  writing 


6  -  -  ir/2  +  6  . 

VXVX  xt 


(140) 


or  equivalently 


(141) 


so  that  In  general  5,^^  >  0  .  Cond)lnlng  equations  (129)  through  (131)  and  (135) 
through  (138)  gives  for  coherent  space  and  time 


(142) 


6^  =  6^^  +  -  6*  +  ir/2 

ax  vx  VXVX  t 


(143) 


6*^  -  6*  +  6  ^ 
vx  t  Xt 


=  6  +  B^^  -  26*  +  ir/2 

X  VXVX  t 


=  6  -  26"  +  6  ^ 
X  t  xt 


Therefore  6^  Is  a  small  number.  Coherent  space  and  time  represents  an  Inter¬ 
nal  motion  in  space  and  time  that  can  be  written  in  complex  number  form  as 


dx  =  jxd6 


dt  =  jtde* 


(144) 


which  is  equivalent  to  equation  (117).  The  magnitude  and  phase  angle  equations 
(132)  and  (133)  are  equivalent  to  the  following  complex  number  expression  for 
the  particle  speed  in  coherent  spacetime 


v*^  «  (dx/dt)^  =  x/t  de  /d8* 

X  X  c 


(145) 


The  magnitude  and  phase  angle  of  the  acceleration  that  are  given  in  equations 
(142)  and  (143)  correspond  to  a  complex  number  acceleration  that  is  obtained 
from  equation  (123)  and  Is  given  by 
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(146) 


5®  -  (d  x/dt^)*^  -  x/t^[d6^/de*(d6^/de*  -  1)  -  jd^e^/de*^l 
-  ’  ^ 

as  it  must  be  becuase  of  the  definition  of  5^^  given  in  equations  (140)  and  (141). 

For  a  repulsive  force  acting  on  a  particle  in  coherent  space  and  coherent 
time  the  positive  acceleration  is  given  by 


*r  -  c + iO 


(147) 


(148) 


(149) 


with 


T.C+  _c 
E  .  =  E  ^ 
xt  xt 


E*^"^  <  0 
xt 


(150) 


xt  xt 


>  0 
xt 


(151) 


tan  s'*  -  E‘=t/F‘'t 

vxvx  xt  xt 


(152) 


-  B®  =  -  v/2  +  6  ^ 
vxvx  vxvx  xt 


(153) 


0*^  =  0*^'  -  6  -  26*  +  6  ^  a*^'*'  -  -  a*^' 

ax  ax  X  t  xt  x  x 


(154) 


-  -  a*^  -  x/t^(-  E*^^  +  jF^^) 
X  X  '  xt  •*  xt' 


(155) 


which  agrees  with  equation  (146). 


3.  SLOW  AND  ULTRAFAST  FORMS  OF  NEWTON'S  LAW  OF  MOTION  IN  BROKEN  SYMMETRY 


SPACETIME .  Newton  s  law  of  motion  for  a  particle  in  a  potential  field  can  be 
written  for  spacetime  with  broken  internal  symmetries  as  follows^“^° 


mi  =  md^x/dt^  =  -  9W/3x  =  F 

X  X 


(156) 


where  m  =  particle  mass  and  W  =  complex  number  potential  which  can  be  written  as 


W  =  W  exp(je^) 


(157) 


The  derivatives  of  the  potential  function  can  be  written  in  four  ways.  Repre¬ 
senting  the  complex  number  force  in  the  following  way 
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(158) 


F  -  F  exp(je_  )  -  -  3W/3X 

gives* 

F^  -  -  cos  8^  sec  6^  3W/3x 

-  -  cos  6^  CSC  8^  W  30y/3x 

■  -  'i”  =«  6ww  '■* 

-  -  sin  8^  CSC  6^  W/x  3e,,/38^ 

and 

®Fx  "  ®W  ■*■  ®VW  “  ®x  ■  ^xx 
where  6^,^  Is  a  small  angle,  and  where 
tan  8^^  -  W30y/3W 


(159) 

(160) 
(161) 
(162) 

(163) 

(164) 


For  repulr.lve  forces  with  F^  >  0  the  phase  angle  and  magnitude  equations  equi¬ 
valent  to  equation  (156)  are  03^^  =  0px  and  max  “  attractive  forces 

the  phase  angle  and  magnitude  equations  that  are  equivalent  to  Newton's  law  in 
equation  (156)  for  nearly  Incoherent  space  and  nearly  incoherent  time  are  ob¬ 
tained  from  equations  (51)  and  (163)  for  Fx  <  0  and  ax  <  0  as 


0 

ax 


(165) 


For  nearly  coherent  space  and  nearly  incoherent  time,  equations  (59),  (60)  and 
(163)  give  for  Fx  <  0  and  ax  >  0 


0  =0^ 
ax  Fx 


(166) 


For  nearly  incoherent  space  and  nearly  coherent  time,  equations  (93),  (94)  and 
(163)  give  for  Fx  <  0  and  ax  >  0 


0' 

ax 


-  ma  =  F 

X  X 


(167) 


For  the  case  of  nearly  coherent  space  and  nearly  cohere  it  time  it  follows  from 
equations  (124),  (125)  and  (163)  that  for  Fx  <  0  and  ax  >  0 


0' 

ax 


(168) 


Newton's  law  of  motion  given  by  equation  (156)  will  now  be  considered  for  the 
four  kinematic  spacetime  conditions  that  were  considered  in  Section  2  and  for 
slow  and  ultrafast  mechanical  processes,  which  produces  eight  possible  cases 
for  Newton's  law  of  motion. 
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Case  1.  Slow  Process,  Incoherent  Space  and  Incoherent  Time. 

This  section  develops  the  Newtonian  law  of  dynamics  for  broken  symmetry 
spacetime  In  a  form  that  Is  suitable  to  make  the  transition  to  the  case  of  a 
slow  mechanical  process  In  Incoherent  space  and  Incoherent  time  which  Is  de¬ 
scribed  by 


e  «  0 

W 


0=0 

X 


e  =  0 

XX 


St  “  ° 


3*  “  0 

^tt 


(169) 

(170) 


where  x  and  t  are  variables.  Combining  equations  (40),  (46),  (47),  (156)  (159) 
and  (163)  gives  Newton's  law  as 


m  cos  B  sec  3  dv  /dt  =  -  cos  3  sec  3,„,  3W/3x 
tt  vxvx  X  XX  WW 


0  =0  +3  -0^-3* 

ax  vx  vxvx  *■ 


tt 


(171) 

(172) 


=  +  Pin,  -  0  -  6 

W  WW  X  XX 

where  3vxvx  ^WW  given  by  equations  (48)  and  (164)  respectively.  Com¬ 
bining  equations  (18)  and  (171)  gives  Newton's  law  of  motion  as 


m  cos  3*  sec  3  d/dt(cos  3*  sec  3  dx/dt) 
tt  vxvx  tt  XX 


(173) 


-  cos  sec  3yy  3W/3x 


Combining  equations  (50)  and  (159)  gives  an  alternative  form  of  Newton's  law 


2  X 

m  CSC  3  cos  3^..  sec  3  dx/dt  d0  /dt 
vxvx  tt  XX  vx 


(174) 


-  cos  3  sec  3,w,  3W/3x 
XX  WW 


Combining  equations  (51)  and  (172)  gives 


0  =0+3  +3  -  2(0*  +  3*  ) 

ax  X  XX  vxvx  t  tt 


(175) 


"  ®W  ®WW  "  ®x  "  ®xx 


-a 


If  the  potential  function  is  given  by  a  power  law  W  'v-  x  then  equation  (164) 
gives 


^WW  ~  ^xx 


(176) 


Therefore  a  reasonable  approximation  to  equation  (175)  for  many  potential  func¬ 
tions  is  given  by 
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(177) 


26  +  e  +6  -  2(6*  +  6*  )  6u 

X  XX  vxvx  '  t  tt  W 


The  limiting  case  of  a  slow  mechanical  process  in  Incoherent  space  and  Incoherent 
time  Is  obtained  by  setting  all  phase  angles  equal  to  zero  and  the  exact  equation 
(173)  becomes 


m  d^x/dt^  ■■  -  dW/9x 


(178) 


which  Is  the  standard  form  of  Newton's  law  of  motion. 

Case  2.  Slow  Process,  Coherent  Space  and  Incoherent  Time. 

This  section  gives  a  form  of  Newton's  law  of  motion  that  can  be  used  to 
regain  the  special  case  of  a  slow  dynamical  process  In  coherent  space  and  Inco¬ 
herent  time  which  Is  described  by 


e„  =  o 


(179) 


6  =  ir/2 

XX 


e-.o 


(180) 


where  and  t  are  variables.  Combining  equations  (19),  (53),  (59)  and  (161) 
gives  Newton's  law  of  motion  for  attractive  forces  as 

-  m  sec  6  cos  6*  dv  /dt  «  -  sin  8  sec  8,„,  x~^  9W/9e  (181) 

vxvx  tt  x  XX  WW  X 


m  sec  8  cos  8*  d/dt(cos  8*^  esc  8  x  d0  /dt) 

vxvx  tt  tt  XX  X 


(182) 


=  sin  8  sec  8,„,  x  9W/90 
XX  WW  X 

with  9W/90JJ  ^  0  .  Combining  equations  (62)  and  (161)  gives  for  attractive 
forces  Newton's  law  as 


m  CSC  8  cos  8  esc  8  x  d6  /dt  d0  /dt 
vxvx  tt  XX  X  vx 


(183) 


=  sin  8  sec  8,„,  x  9W/90 
XX  WW  X 

where  the  general  expression  for  8vxvx  given  by  equation  (48)  and  d0vx/dt 
is  given  by  equation  (56) .  The  phase  angle  equation  for  this  general  case  is 
given  by  equations  (63)  and  (163)  as 


0+8  +8  -  2(0*  +  8*  )  -  n  =  0„  +  8.„,  -  0  -  8 

X  XX  vxvx  t  tt  W  nJW  X  a 


(184) 


which  is  valid  for  attractive  forces. 


The  limiting  case  of  a  slow  dynamical  process  in  coherent  space  and  in¬ 
coherent  time  is  obtained  from  equations  (180),  (182)  and  (183)  which  give  for 
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attractive  forces 


gel  2  2  —1 

®vxvx  ^  ®MW  *  SM/aSjj  (185A) 

sd  2  —1 

®“  ‘^®*^  ®VJCVX  ‘  ®*‘^  ®WW 


Combining  equations  (68)  through  (71)  with  equation  (185)  gives  Newton’s  law 
for  attractive  forces  as 


«>«[(E®J^)^  + 


(p8Cij2jl/2 


®ec  X 


-1 


aw/ae 


(186) 


where  for  attractive  forces  aw/ao^  ^  0  ,  and 
forms  as  in  equations  (69)  and  (70)  but  with 


-  d^e  /dt^  >  0 

Xt  X  xt 


where  E^t  and  Fxt  have  the  same 

the  sign  of  F®f ^  reversed 
xt 


(187) 

(188) 


Ihe  case  of  a  slow  process  in  coherent  space  and  Incoherent  time  is  obtained 
by  combining  equations  (179)  and  (186)  which  gives  for  attractive  forces  the 
following  form  of  Newton’s  law 


a(E®f  )^  + 


x~' aw/ae 


(189) 


with  aw/aSx  ^  0  .  me  phase  angle  equation  for  Neaton’s  law  of  motion  In  co¬ 
herent  space  and  Incoherent  time  Is  obtained  from  equations  (63) «  (163)  and 
(166)  to  be  for  slow  attractive  forces  with  0y  ■  0 


ax 


0 

X 


+  6 


scl 

vxvx 


-  Tr/2 


(190) 


-  0  -  ir/2 

X 

iSCl 


where  for  this  case  3^x^  given  by  equation  (68) 


tan  0®*=^  -  E®‘=Vf®‘^^ 

'^vxvx  xt  '  xt 


(191) 


so  that  0vx^  n  small  positive  number.  Equation  (190)  can  also  be  written  as 

.aci  ,  f\/\  _  f\  n  _  f\  (192) 


0“'-*  +20  »  0 
vxvx  X 


where  for  this  case  0^^  ^  0  and  0x  ^  0  and 


^WW  ^  ®xx 


(193) 


The  equivalent  complex  number  form  of  Newton's  dynamical  law  for  a  slow  mech¬ 
anical  process  in  coherent  space  and  incoherent  time  is 
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-  -  JF®^^)  -  -  3W/3X  -  J/x  3W/3e^ 


(194) 


For  repulsive  forces  equations  (53)  and  (161)  give  Newton's  law  of  mo¬ 
tion  as 


^  X  *  1 

m  sec  3  cos  6^^  dv  /dt  -  -  sin  3  sec  3»-,  x  3W/3e 
vxvx  tt  X  XX  'W  > 


or  equivalently 


(195) 


(196) 


m  sec  3"*"  cos  3*  d/dt(cos  3*  esc  3  x  de  /dt) 
vxvx  tt  tt  XX  X 

=  -  sin  3  sec  3,-1  3W/3e 

XX  WW  X 


Combining  equations  (55)  and  (161)  gives  Newton's  law  of  motion  for  repulsive 
forces  as 


-t-  2  X 

m  CSC  3  cos  3^^  esc  3  x  de  /dt  de  /dt 
vxvx  tt  XX  X  vx 

«  -  sin  3  sec  3,„,  x“^  3W/3e 
XX  WW  x 


(197) 


with  3W/3ex  ^  0  .  For  repulsive  forces  equations  (57)  and  (163)  give 
e  +  3  +3'*’  -  2(e*^  +  3*  )  -  e„  +  3,„,  -  e  -  3  +  ir 

X  XX  vxvx  '  t  tt'  w  WW  X  XX 


(198) 


For  the  limiting  case  of  a  slow  mechanical  process  that  occurs  In  coherent 
space  and  Incoherent  time  equations  (196)  and  (197)  for  repulsive  forces  give 
the  following  form  of  Newton's  law  of  motion 


«!cl+  2  2  -1 

mx  sec  3  d^e  /dt  =  -  x  ^3W/3e 
vxvx  X  X 

scl+  2  -1 

mx  CSC  3  (de  /dt)  =  -  x  3W/3e 
vxvx  X  > 


where 


tan 

VXVX  xt  xt 


,scl+  „scl 


xt 

scl+ 

xt 


=  E 


=  F 


xt 

scl 

xt 


(de^/dt)^ 
d^e  /dt^ 

X 


>  0 
xt 


F®^^  >  0 
xt 


scl+  ^  scl 
vxvx  vxvx 

and  equation  (200)  gives  Newton's  law  of  motion  for  a  repulsive  force  as 


(199A) 

(199B) 

(200) 

(201) 

(202) 

(203) 
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-  -  x'^3W/30^ 


(204) 


with  3W/36^  ^  0  ,  and  equations  (198)  and  (194)  give  Newton's  law  of  notion  for 
repulsive  forces  in  coherent  space  and  Incoherent  time  as 


20  +  6®^^ 

X  vxvx 


0 

,sci+v 


0, 


W 


iia(E®^^‘^  -  ”  -  3W/3i  -  j/x  3W/30^ 


(205) 

(206) 


Equivalently  Newton's  law  of  motion  in  equation  (206)  for  repulsive  forces  Is 
written  as 


”“^^xt  ■  J  xt  ^  aw/ 30^ 


(207) 


which  is  valid  for  repulsive  forces  acting  in  coherent  space  and  incoherent 
time. 


Case  3.  Slow  Process,  Incoherent  Space  and  Coherent  Time. 

In  this  section  a  form  of  Newton's  dynamical  law  Is  developed  for  broken 
symmetry  spacetime  that  can  be  used  to  attain  the  case  of  a  slow  mechaxkical 
process  in  Incoherent  space  and  coherent  time  which  Is  described  by 

®W  “  °  ^WW  “  0  ^208) 

®x  “  °  ^xx  “  °  ® ‘ 

where  x  and  0^  are  variables.  Combining  equations  (95)  and  (159)  gives  Newton ' s 
law  for  an  attractive  force  as 

-  m  sec  6  sin  6*  t  ^  dv  /d8*  *  -  cos  B  sec  6,_,  3W/3x  (210) 

vxvx  tt  X  t  XX 

From  equations  (96)  and  (159)  It  follows  that  equation  (210)  can  be  written  as 

m  sec  3  sin  3*  t  ^  d/d0*(sin  3*  sec  3  t  ^  dx/d6*)  (211) 

vxvx  tt  t  tt  XX  t 

=  cos  3  sec  3,„,  3W/Sx 

XX  WW 

Combining  equations  (90),  (93)  and  (159)  gives  Newton's  dynamical  law  for 
attractive  forces  as 

-  m  CSC  3  sin  3^^  v  /t  d0  /d8*  =  -  cos  3  sec  B.-,  3W/9x  (212) 

vxvx  tt  X  vx  t  XX  WW 

or  equivalently 

m  CSC  3  sin^B*  sec  3  t  ^  dx/dS^  d6  ^/dS^  (213) 

vxvx  tt  XX  t  vx  t 

=  cos  3  sec  3,„,  3W/3x 
XX  WW 
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Equations  (210)  through  (213)  are  valid  for  attractive  forces  for  uhlch 
dU/9x  >  0  .  The  phase  angle  condition  for  Newton's  law  of  motion  for  this 
case  is  obtained  from  equations  (97)  and  (163)  as 

0  +  B  +  B  -  2(0*  +  B*^)  +  ir  -  0,,  +  B,-,  -  0  -  B  (214) 

X  XX  vxvx  '  t  tt'  W  WW  X  XX 

which  is  valid  for  attractive  forces. 


For  the  limiting  case  of  a  slowly  changing  potential  in  Incoherent  space 
and  coherent  time  Newton's  dynamical  equations  (211)  and  (213)  for  attractive 
forces  become 


mt  ^  sec  B*^^  d^x/d0*^  =  3W/3x 

vxvx  t 

mt  ^  CSC  (-  dx/d8^)  =  3W/3x 

vxvx  t 


(215) 


(216) 


For  this  case  Newton's  dynamical  laws  (211)  and  (213)  can  also  be  written  for 
attractive  forces  as 


(217) 


with  3U/3x  >  0  for  an  attractive  force,  and  where 
tan 

VXVX  xt  xt 


(218) 


=  -  dx/d0* 
xt  xt  t 

=  F^^  =  d^x/de*^ 
xt  xt  t 


>  0 
xt 

F®i^  >  0 

xt 


(219) 


(220) 


where  F^t  given  by  equations  (103)  and  (104).  It  is  clear  that 


B®^‘'  = 

VXVX  vxvx 


(221) 


where  is  a  small  number  given  by  equation  (102) .  The  corresponding  phase 

angle  condition  obtained  from  equation  (214)  is 


-  29*  -  0 

VXVX  t 


e„  =  o 


(222) 


The  equivalent  complex  number  form  of  Newton's  law  of  motion  is  written  for 
attractive  forces  as 


-  m/t^(F®J‘^  +  jE®^*^)  =  -  3W/3x  =  -  3W/3x 


(223) 


which  holds  for  slow  attractive  forces  in  incoherent  space  and  coherent  time. 
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In  the  case  of  repulsive  forces  Newton’s  dynamical  equations  (87)  and 
(159)  give 

®  Kxvx  ®tt  ®xx  ®®‘^  ®WW  <^24) 

while  equations  (89)  and  (159)  give  Newton’s  law  for  repulsive  forces  as 

m  sec  8^^  sin  t“^  d/d0*(sln  B*^  sec  B^  t"^  dx/dB*)  (225) 

“  “  ‘^°®  ®xx  ®®^  ^WW 


Combining  equations  (91)  and  (159)  gives  Newton's  dynamical  law  for  repulsive 
forces  as 


m  CSC  B"*^  sin^B*  sec  B  t  ^  dx/d0*  d0  /d0* 

VXVX  tt  XX  t  vx  t 


(226) 


~  ^°®  ®xx  ®®‘^  ^JW 


where  for  a  repulsive  force  3W/3x  <  0  .  For  a  repulsive  force  equations  (88) 
and  (163)  give 


0+6  +  B‘‘‘  -  2(0*  +  0*  >  -e„+B^-0  -  B  -IT 

X  XX  VXVX  t  tt  W  TIB  X  XX 


(227) 


In  the  limiting  case  of  a  slow  mechanical  process  in  incoherent  space  and  co¬ 
herent  time  Newton's  dynamical  equations  (225)  and  (226)  become  for  repulsive 
forces 


mt"^  sec  B®^*^'*’  d^x/d0*^  -  -  3W/3x 

VXVX  t 

mt“^  CSC  6®^*^'*’  (-  dx/d0*)  -  -  3W/3x 
VXVX  '  t 


where 


tan  B 


sic+ 

VXVX 


sic+,  slc+ 
®xt  '  xt 


(228) 

(229) 


(230) 


= 

slc+ 

>  0 

xt 

xt 

xt 

xt 

FSic+ 

>  0 

xt 

= 

VXVX 

xt 

®ic 

'’vxvx 

xt 

xt 

(231) 

(232) 

(233) 


and  Newton's  dynamical  law  for  repulsive  forces  in  equations  (228)  and  (229) 
becomes 
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-  3W/3x 


(234) 


Equivalently,  equation  (234)  can  be  written  as 

mt“^[(E^J)^  +  -  -  3W/3X 


(235) 


with  3W/3x  <  0  £or  a  repulsive  force.  The  phase  angle  equation  (227)  for 
Newton's  dynamical  law  with  repulsive  forces  becomes  with  6y  «  0 


6*^“=  -  26* 
vxvx  t 


0*  >  0 


(236) 


6^*^  -  26* 
vxvx  t 


0*  >  0 


(237) 


Finally,  the  complex  number  form  of  Newton's  law  of  motion  for  a  slow  mechani¬ 
cal  process  In  Incoherent  space  and  coherent  time  and  for  a  repulsive  force  is 
written  in  an  analogous  fashion  to  equation  (223)  as 


m/t^(F®J‘^‘'‘  +  jE®J‘^‘‘‘)  -  -  3W/3x  -  -  3W/3x 


(238) 


Equation  (238)  can  be  written  In  the  following  two  equivalent  forms  of  Newton's 
dynamical  law  of  motion  for  this  special  case 


m/t^(FjJ''  +  jE^^^")  »  -  3W/3X 
m/t^(F^^  +  jE^^)  =  -  3W/3X 


(239) 


(240) 


where  e„  =■  0  ,  and  where  for  a  repulsive  force  3W/3^  <  0  . 

Case  4.  Slow  Process,  Coherent  Space  and  Coherent  Time. 

A  form  of  Newton's  law  is  considered  that  can  be  reduced  to  the  special 
case  of  a  slow  mechanical  process  in  coherent  space  and  coherent  time  which  Is 
described  by 


^ww  =  0 

®xx  = 


e„  =  0 


(241) 

(242) 


where  and  0*  are  variables.  A  form  of  Newton's  law  that  can  be  used  for 
this  case  comes  from  equations  (36),  (124)  and  (161)  with  the  result  that  for 
an  attractive  force 

-  m  sec  8  sin  B*  t  ^  dv  /d6*  »  -  sin  B  sec  B,-,  x  ^  3W/36  (243) 

vxvx  tt  X  t  XX  WW  X  '  ' 


or  combining  equations  (21)  and  (243)  gives  Newton's  dynamical  law  for  an 
attractive  force  as 
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HI  sec  6^  sin  B*  d/de*(sin  6*  esc  6^,  x/t  d0^/d0*)  (244) 

VXVX  tC  C  tt  XX  X  c 

-  ‘i”  •“  «viu  *■* 


where  for  an  attractive  force  3W/30x  ^  0  .  Another  form  of  Newton's  law  that 
is  suitable  for  this  case  is  obtained  from  equations  (118),  (123),  (124)  and 
(161)  which  give  for  an  attractive  force 


■  “  ®vxvx  ‘^^vx/^®^  ^245) 

-  -  sin  sec  x“^  3W/30^ 

Equation  (245)  can  be  rewritten  using  equation  (126)  to  give  the  following 
form  of  Newton's  dynamical  law  for  attractive  forces 

<=“  6vxw  «xx 

■  ®xx  ®VH.  ®"'®®x 

where  d0^/d0*  is  given  by  equation  (121).  For  an  attractive  force  3W/30][  >  0  . 
The  corresponding  phase  angle  equations  (128),  (163)  and  (168)  give  the  i^ase 
angle  relation 


0  +  B  +  B  -  2(0*  +  B*  )  +  TT 

X  XX  VXVX  t  tt 


®WW  "  ®x  "  ®xx 


(247) 


which  is  valid  for  a  slow  mechanical  process  in  nearly  coherent  space  and  near¬ 
ly  coherent  time. 


The  case  of  a  slow  mechanical  process  that  occuirs  in.  ccdicrent  space  and 
coherent  time  follows  from  equations  (121),  (241),  (242),  (244)  and  (246)  which 
give  Newton's  dynamical  law  as 


mxt"^  sec  d\/d0*^  -  x“^3W/3e^ 

VXVX  X  t  X 

mxt”^  CSC  B®*^^  d6^/d0*(d0^/de*  -  1)  =  x"^3W/30^ 

VXVX  x  t  X  t  X 


(248) 

(249) 


Equations  (248)  and  (249)  can  be  written  as 
mxt"^[(Ef^‘')^  +  (F®^*")^]^^^  =  x"^3W/3e^ 


xt 


where  for  an  attractive  force  3W/30x  ^  0  ,  and  where 
,scc 


tan  B 


VXVX 


„scc /„SCC 
^xt  /^xt 


(250) 


(251) 
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(252) 


-  d0  /de*(d0  /d0*  -  1) 

Xt  X  t'  X  t 

see  _  ^2g  y^gX2 

xt  X  t 


E®^*^  <  0 
xt 


F®^*^  <  0 
xt 


(253) 


It  follows  that 


flSec 

0  -  -  X  +  a  ^ 

vxvx  xt 


(254) 


where  the  siaall  positive  number  is  given  by 


tan  a  -  i  E®*"*"  I  / 1 

xt  '  xt  '  xt 


(255) 


For  this  case  the  phase  angle  equation  (247)  becomes  with  0y  0 


0  -  20*  +  0®*^*^  +  ff/2  -  -  0  -  W./2 

X  t  vxvx  X 


(256) 


Equation  (256)  can  be  rewritten  with  the  help  of  equation  (254)  as 


2(0  -  0*)  +  o  ^ 

X  t  xt 


0*  >  0 
t  X 


(257) 


The  complex  number  form  of  Newton ;s  dynamical  law  that  Is  equivalent  to  equa¬ 
tions  (248)  through  (257)  is  written  as  for  attractive  forces  as 


-  iiK/t^(-  -  -  3W/3x  =  j/x  3W/30^ 


(258) 


where  0g  *  0  for  a  slow  mechanical  process,  and  3W/30x  ^  0  for  an  attractive 
force. 

For  a  repulsive  force  equations  (122)  and  (161)  give  the  form  of  Newton's 
dynamical  law  as 

m  sec  sin  0*^  t  ^  d/d0*(sin  0*^  esc  0^^  x/t  d0^/d0*)  (259) 

“  ~  ^xx  ®WW  3W/30^ 


while  equations  (120)  and  (161)  give  for  a  repulsive  force 

mxt  ^  CSC  0^  sin^0*  esc  0  d0  /d0*  d0  /d0* 
vxvx  tt  XX  x'  t  vx  t 


(260) 


=  -  sin  0  sec  0^-,  x  3W/30 
XX  WW  x 


with  3W/30X  <  0  for  repulsive  forces.  For  a  repulsive  force  equations  (119) 
and  (163)  give 


0+0  +0''’  -  2(0*  +  0*  )  »=  0„  +  0„,  -  0  -  6  -  TT 

X  XX  vxvx  t  tt  W  WW  X  XX 


(261) 
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In  the  limiting  case  of  a  slowly  varying  repulsive  potential  in  coherent  space 
and  coherent  time  equations  (259)  and  (260)  become 


mxt"^  CSC  de^/de*(de^/de*  -  i)  -  -  x"^aw/3e^ 


idiere 


tan 

‘^vxvx  xt  '  xt 


(262A) 

(262B) 

(263) 


g8CC+  ^  gSCC 

Xt  Xt 

„8CC+  _  „SCC 


E*'‘=+  <  0 

Xt 

<  0 

Xt 


(264) 

(265) 


gSCC+  _  see  =  _  ^  + 
vxvx  vxvx  xt 


(266) 


Equation  (262)  for  repulsive  forces  can  be  rewritten  as 

=  -  x'^aw/ae^ 


(267) 


with  3W/30JJ  <  0  ,  while  equation  (261)  becomes  with  By  =  0 


e  -  20*  +  a  , 
X  t  xt 


(268) 


or  equivalently 


2(0  -  0*)  +  a  ^ 

X  t  xt 


0*  >  0 
t  X 


(269) 


For  a  repulsive  force  the  complex  number  form  of  Newton's  dynamical  law  that 
is  equivalent  to  equations  (259)  through  (269)  is  written  as 


inx/t^(-  +  jF®^'")  =  -  3W/3X  =  j/5E  3W/30^ 


(270) 


where  3W/30jj  <  0  for  a  repulsive  force. 

Case  5.  Ultrafast  Process,  Incoherent  Space  and  Incoherent  Time. 

Consider  now  the  form  of  Newton's  law  of  motion  that  can  be  used  to  re¬ 
gain  the  case  of  an  ultrafast  process  in  Incoherent  space  and  Incoherent  time 
that  is  described  by 


^WW 


0=0 

X 


4-0 


(271) 

(272) 
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with  X  and  t  as  variables.  Conblning  equations  (40),  (46),  (47),  (51),  (156), 
(160),  (163)  and  (165)  gives  Newton's  dynamical  law  as 

m  cos  6*^  sec  dv^/dt  -  -  cos  6^^^  esc  B^^  W  30^/ ax  (273) 


6  «e  +B  _0*-e* 
ax  vx  vxvx  t  tt 


(274) 


0  +  B  +  B  -  2(0*  +  B*  ) 
x  xx  vxvx  '  t  tt 


-  0„  +  B™  -  0  -  B 

W  W  X  XX 


with  Bvxvx  ^WW  given  by  equations  (48)  and  (164)  respectively.  Combining 
equations  (18)  a^  (273)  gives  Newton's  dynamical  law  as 

m  cos  B*  sec  B  d/dt(cos  B*^  sec  B_  dx/dt)  (275) 

VaVa  AA 


-  -  CO.  6^  C.C  W  3  Vte 

Combining  equations  (50)  and  (160)  gives  an  alternative  form  of  Newton's  law  of 
motion  as 

”  ®vxv*  ''x  ®xx  =“  6hw  " 

or  equivalently 

0  V 

“  ^vxvx  ®tt  ^xx 

”  "  ^xx  ®WW  ” 

with  W36y/ax  ^  0  for  an  attractive  force,  and  W30y/3x  ^  0  for  a  repulsive  force. 

The  limiting  case  of  motion  in  incoherent  space  and  incoherent  time  is 
obtained  by  combining  equations  (272)  and  (275)  with  the  result  that  Newton's 
dynamical  law  becomes 

m  d^x/dt^  =  -  CSC  B^j^  W  30y/3x  (278) 


Hie  further  limiting  case  of  an  ultrafast  dynamical  system  in  incoherent  space 
and  Incoherent  time  is  derived  by  combining  equations  (271)  and  (278)  which 
gives  Newton's  law  of  motion  as 

m  d^x/dt^  -  -  W30^/3x  (279) 


The  phase  angle  equation  for  Newton's  law  of  motion  in  this  case  is  obtained 
by  combining  equations  (271),  (272)  and  (274)  w:'th  the  result 
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0Uil 

ax 


gUii 

vxvx 


0„  +  ir/Z 


(280) 


i^ere  equations  (279)  and  (280)  are  valid  for  attractive  and  repulsive  forces. 
For  an  attractive  force  W36y/3x  >  0  %ihile  for  a  repulsive  force  Wd6y/dx  <  0  . 

Case  6.  Ultrafast  Process.  Coherent  Space  and  Incoherent  Jime. 

This  section  develops  a  form  of  Newton's  dynamical  law  that  is  suitable 
for  making  the  transition  to  the  case  of  an  tiltrafast  dynamical  system  in  co¬ 
herent  space  and  incoherent  time  which  is  described  by 

^  -  it/2  (281) 

®xx  “  e*  -  0  ®tt  "  ° 

Combining  equations  (53) .  (59)  and  (162)  gives  for  an  attractive  force  the  fol¬ 
lowing  form  of  Newton's  law  of  motion 

-  m  sec  cos  6*^  dv^/dt  «  -  sin  8^  esc  W/x  36y/06jj  (283) 

Combining  equations  (19)  and  (283)  gives  Newton's  law  as 

m  sec  cos  8*  d/dt(cos  8*  esc  8„  x  de  /dt)  (284) 

vXVA  td  tw  XX  X 

“  ®xx 

where  for  an  attractive  force  WdSy/dS^^  >  0  .  The  combination  of  equations  (61) 
and  (162)  gives  an  alternative  form  of  Newton's  law  of  motion  for  an  attractive 
force  as  follows 

•  "  \  ®WM 

Combining  eqxiatlons  (62)  and  (162)  or  coinblnlng  equations  (19)  and  (285)  gives 
Newton's  dynamical  law  for  attractive  forces  as 

m  CSC  8  cos^8*  esc  8  x  d9  /dt  d6  /dt  (286) 

VXVX  Cw  XX  X  vx 


■  ®WW 


where  d6^/dt  is  given  by  equation  (56).  The  phase  angle  equation  for  the 
acceleration  can  be  obtained  from  equations  (63)  and  (163)  as 


“x  +  6xx  +  6„vx  -  +  6«>  -  ”  -  -  ®xx 


(287) 


which  is  valid  for  a  fast  process  in  nearly  coherent  space  and  nearly  incoher¬ 
ent  time. 
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For  the  llnltlng  case  of  an  attractive  ultrafast  mechanical  force  in  co¬ 
herent  space  and  Incoherent  time,  equations  (281),  (282),  (284)  and  (286)  give 
Newton's  dynamical  law  as 

mx  sec  6“^  d^e  /dt^  -  W/x  30  /36  (288) 

V  A  vX  X  W  X 

UK  CSC  6^^  (de^/dt)^  -  W/x  3e^^/3e^  (289) 


Equations  (288)  and  (289)  can  be  rewritten  as  the  following  form  of  Newton's 
law  of  motion 


(pUci)2jl/2 


W/x  36„/30 
W  X 


(290) 


where  for  an  attractive  force  W30y/30x  ^  ®  where 


tan  .. 

VXVX  xt  xt 

(291) 

eUcI  _  cl  ^  ^^^^2 

xt  xt  '  X  ' 

e“?‘> 

xt 

0 

(292) 

.  d^e  /dt^ 

Xt  Xt  X 

F“i  < 

xt 

0 

(293) 

6“*^^  =  6^^  =  ir/2  +  K 

VXVX  VXVX  xt 

(294) 

where  B^vx  is  given  by  equation 

(68)  and  K^t  i® 

given  by  equation  (73) . 

The 

internal  phase  angle  equation  for  Newton's  law  of  motion  for  this  case  follows 
from  equations  (281),  (282),  (287)  and  (294)  as 


0 

X 


+  6 


uci 

vxvx 


Tr/2  =  0,-0 

W  X 


(295) 


or  equivalently 

%  =  2e^  +  (296) 

The  equivalent  complex  number  form  of  Newton's  law  of  motion  for  an  attractive 
force  in  this  special  case  is  given  by 

-  mx(E“^^  -  jF“^^)  =  -  3W/3x  =  -  W/x  30„/30  (297) 

xt'^xt  Wx 

with  W36^j/36x  >  0  for  an  attractive  force. 

For  a  repulsive  force  equations  (53)  and  (162)  give  Newton's  law  of 
motion  as 
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(298) 


n  sec  cos  d/dt(cos  B*^  esc  B^^  x  de^^/dt) 

-  -  sin  B  CSC  B._,  W/x  36„/3e 
XX  WW  W  X 

Alternatively,  combining  equations  (55)  and  (162)  gives  Newton's  dynamical  law 
for  repulsive  forces  as 

“  *vicvx  (299) 

■  -  .in  C.C  W/x  3e„/39^ 


with  y30y/36j(  <  0  .  For  repulsive  forces  equations  (57)  and  (163)  give  the 
phase  angle  condition  as 


0  +  B  +  B  -  2(0*  +  B*  )  -  0„  +  B,„,  -  0  -  B  +  x 

X  XX  vxvx  .  t  tt  W  X  XX 


(300) 


In  the  case  of  an  ultrafast  mechanical  process  that  occurs  in  coherent  space 
and  incoherent  tliu^,  equations  (298)  and  (299)  become  for  repulsive  forces  the 
following  Newtonian  laws  of  motion 


mx  sec  B  d‘0  /dt  -  -  W/x  30„/30 
vxvx  X  W  X 

mx  CSC  (d0  /dt)^  *  -  W/x  30„/30 

vxvx  X  W  X 

where  W30y/36x  ^  0  for  a  repulsive  force,  and  where 

tan 

vxvx  xt  xt 


V 


with 


_uci+  -uci  ,2.  ,,.2 
=d0  /dt 

xt  xt  X 


Then  it  follows  that 

UC1+  ,  uni  .  ,/2  + 

VXVX  VXVX  xt 


>  0 
xt 

<  0 
Xt 


(301) 

(302) 

(303) 

(304) 

(305) 

(306) 


where  given  by  equation  (73).  Then  equations  (301)  and  (302)  can  be 

written  as 


^j(EUCi+)2  ^  ^EUCi+j2jl/2  ^  30^30^ 


(307) 


which  is  valid  for  repulsive  forces  with  W30y/30x  ^  0  .  Equation  (300)  becomes 


0  +  K  ^ 

X  xt 


0„  -  0 

W  X 


(308) 
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Then  the  equivalent  of  equation  (297)  for  repulsive  forces  Is 

-  -  3W/3X  «  -  W/x  36^,/ 36^ 


(309) 


with  wasy/dex  <  0  . 

Case  7.  Ultrafast  Process,  Incoherent  Space  and  Coherent  Time. 

A  formulation  of  Newton's  dynamical  law  Is  presented  that  correctly  re¬ 
duces  to  the  limiting  case  of  an  ultrafast  mechanical  process  in  Incoherent 
space  and  coherent  time  which  Is  described  by 


®WW  “ 


8. 


XX 


6*t  -  ^/2 


(310) 

(311) 


where  x  and  6^  are  variables.  Coiid>lnlng  equations  (95)  and  (160)  gives  Newton's 
law  of  motion  for  attractive  forces  as 

-  m  sec  8  sin  8*  t  ^  dv  /d0*  «  -  cos  8  esc  8,w,  W  3e„/3x  (312) 

vxvx  tt  x  t  XX  WW  W  ' 


Using  equations  (20)  and  (96)  allows  equation  (312)  to  be  written  as 
m  sec  8^^  sin  8*^.  t"^  d/d0*(sin  8*^  sec  8^^^  t”^  dx/dS*) 


(313) 


*=  cos  8  CSC  8,-,  W  30-,/ 3x 
XX  WW  W 

with  W30y/3x  ^  0  .  An  alternative  form  of  Newton's  law^of  motion  for  this 
situation  comes  from  equations  (90),  (91)  and  (160)  with  the  result  that  for 
an  attractive  force 

-  m  CSC  8  sin  6*  v  /t  d6  /d6^  =  -  cos  8  esc  8,„,  W  30,./ 3x  (314) 

vxvx  tt  X  vx  t  XX  WW  W 

or  equivalently  using  equations  (20)  and  (314)  gives  Newton's  dynamical  law  as 

m  CSC  8  sin^8^^  sec  6  t  ^  dx/d©^  d6  /d0*  (315) 

VXVX  tt  XX  t  vx  t 


=  cos  8  CSC  W  30„/3x 
XX  WW  W 

with  W30y/3x  >  0  for  an  attractive  force.  Equations  (312)  through  (315)  are 
completely  general.  The  phase  angle  equation  for  this  general  case  can  be 
written  using  equations  (97)  and  (163)  as 


e  +  8  +8  -  2(0*  +  8*^)  +  IT  =  0„  +  -  0  -  6 

X  XX  vxvx  '  t  tt  W  WW  X  XX 


(316) 


which  can  be  used  to  attain  the  limiting  case  of  Interest. 
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The  limiting  case  of  an  ultrafast  attractive  potential  acting  In  Inco¬ 
herent  space  and  coherent  time  Is  obtained  from  equations  (310).  (311),  (313) 
and  (315)  as  the  following  forms  of  Newton's  dynamical  law 


mt  ^  sec  d^x/de*^  •«  W36,,/3x 

vxvx  t  W 

mt"^  CSC  b“^^  (-  dx/de*)  -  W3e„/3x 

vxvx  t  W 


(317) 

(318) 


Equations  (317)  and  (318)  can  also  be  written  for  an  attractive  force  to  give 
Newton's  law  of  motion  as 


int"^[(E"^'')^  +  «  W3e^j/3x 


(319) 


with  W36y/3x  ^  0  ,  and  where 

tan  6“^'  - 

VXVX  xt  xt 

(320) 

=  -  dx/de* 
xt  t 

(321) 

=  d^x/def 

Xt  t 

>  0 

(322) 

-ulc 

6  =  ir/2  -  Y  ^ 

vxvx  xt 

(323) 

where 

*  fUic /pUic 

tan  Y  a.  =  F  ^  /E  ^ 

'xt  xt  xt 

(324) 

and  Yxt  ^  small  positive  number. 

(316)  Is  written  as 

For  this  case 

the  phase  angle  condition 

_  20*  „  0  ^  ^12 
vxvx  t  W 

(325) 

or  equivalently  as 

«W  =  -  \t  - 

(326) 

The  complex  number  form  of  Newton's  law  of  motion  for  an  attractive  ultrafast 
potential  In  Incoherent  space  and  coherent  time  Is  given  by 

-  m/t^(F“J:‘^  +  =  -  3W/3x  -  -  jW3e  /3x  (327) 

X  C  X  C  W 

with  W30y/3x  ^  0  . 

In  the  presence  of  a  repulsive  force,  equations  (89)  and  (160)  give 
Newton's  equations  of  motion  as 
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in  sec  P"*"  sin  P*  t  ^  d/d0*(sln  P*  sec  P  t  ^  dx/dP*)  (328) 

V3C  L  t  C  C  C  3UC  t 

-  -  cos  6„  CSC  6^  W  3e„/sx 

Combining  equations  (91)  and  (160)  for  repulsive  forces  gives  Newton's  law  as 
m  CSC  p"*"  sin^P*  sec  P  dx/d6*  de  /d0*  (329) 

VXVX  tt  XX  t  vx  t 

«  -  cos  P  CSC  P,_,  W  30-,/ 3x 
XX  VIW  W 


with  W30y/3x  <  0  for  a  repulsive  force.  For  a  repulsive  force  equations  (88) 
and  (163)  give  the  phase  angle  condition  as 


0+8  +  er  -  2(0*  +  P*  )  -  0,,  +  P,_,  - 

X  XX  VXVX  t  tt  W  TiW 


0  - 
X 


XX 


-  IT 


(330) 


For  the  case  of  an  ultrafast  mechanical  process  in  Incoherent  space  and  coher¬ 
ent  time  equations  (311),  (328)  and  (329)  become 


mt"^  sec  d^x/d0*^  =  -  W3e„/3x 

VXVX  t  W 

mt  ^  CSC  (-  dx/d6*)  =  -  W30^/3x 


(331) 

(332) 


where  W30y/3x  <  0  for  repulsive  forces,  and  where 


tan  P 


uic+ 

VXVX 


uic+ 

^xt 


/F. 


uic+ 

xt 


(333) 


pUic+  _ 
\t  - 

=  -  dx/d0* 
xt  t 

uic+ 

^xt 

pUic+  _ 

xt 

uic  ,2  ,,-x2 

F  ^  =  d  x/d6^ 

xt  t 

uic+ 

^xt 

(334) 

(335) 


uic+ 

VXVX 


guic 

'^vxvx 


tt/2  -  Y 


xt 


(336) 


where  Yxt  given  by  equation  (324).  Equations  (331)  and  (332)  can  be  written 
for  repulsive  forces  as 


mt"^[(E"J‘''^)2  +  =  -  W30y3x 


(337) 


while  the  phase  angle  equation  (330)  becomes 


0 


W 


(338) 


The  corresponding  complex  number  form  of  Newton's  law  of  motion  for  a  repulsive 
ultrafast  potential  acting  in  Incoherent  space  and  coherent  time  is 
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(339) 


-  -  3W/3X  -  -  jW3ey/3x 

with  W36y/dx  <  0  and  Ox  “  ^  ^  repulsive  force  In  Incoherent  space. 

Case  8.  Ultrafast  Process,  Coherent  Space  and  Coherent  Tine. 

This  section  considers  a  formulation  of  Newton's  law  of  dynamics  In  bro¬ 
ken  symmetry  space  and  time  that  reduces  to  the  limiting  case  of  an  ultrafast 
mechanical  process  In  totally  coherent  spacetime  which  Is  described  by 

6^  =  ^/2  (340) 

^xx  =  6*,  -  W2  (341) 

where  e^.  and  6*  are  variables.  Equations  (36),  (124)  and  (162)  give  Newton's 
law  of  motion  for  an  attractive  force  as 

-  m  sec  6  sin  8*  t"^  dv^/de*  -  -  esc  sin  6^  W/x  3e„/3e^  (342) 

VXVX  tt  Xt  WW  XX  Wx 

\ 

or  equivalently  combining  equations  (21)  and  (342)  gives  Newton's  law  for  an 
attractive  force  as 

mt"^  sec  sin  6*^  d/de*(sln  8^^  esc  8^  x/t  de^/de*)  (343) 

=  CSC  slm  8^  \t/x  36^j/’30^ 

Another  form  of  Newton's  law  of  motion  for  this  case  is  obtained  from  equations 
(118),  (123),  (124),  (156)  and  (162)  which  gives  the  following  result  for  an 
attractive  force 

-  “  ®vxvx  ®tt  ^x^*' 

=  -  CSC  8^^  sin  8^^  W/x  36^/36^ 

Equation  (344)  can  be  rewritten  using  equations  (21)  or  (126)  as  follows 

mxt  ^  esc  8  sin^8^^  esc  8  d0  /d0*  d6  /d0*  (345) 

vxvx  tt  XX  X  t  vx  t 


=  CSC  6^^  sin  8^^  W/x  30„/3e^ 


where  W30y/80jj  >  0  for  the  case  of  an  attractive  force.  The  corresponding 
phase  angle  equations  (128),  (163)  and  (168)  give 


0+8  +3 

X  vxvx  XX 


-  2(0^  +  8^,)  +  IT  =  e„ 


-  0  -  8. . 


(346) 


which  is  valid  for  nearly  coherent  space  and  nearly  coherent  time. 


The  case  of  an  ultrafaat  attractive  mechanical  potential  in  coherent 
space  and  coherent  time  is  obtained  from  equations  (340)  and  (341),  and  New¬ 
ton's  dynasdcal  law  for  this  special  case  Is  determined  from  equations  (343) 
and  (345)  to  be 

fflict"^  sec  6^  d^e^/de*^  -  W/x  3e„/30^  (347A) 

mxt"^  CSC  de^/d0*(de^/de*  -  1)  -  W/x  30„/30^  (347B) 

vX  vX  X  w  X  L  VV  X 


Newton's  law  of  motion  in  equation  (347)  can  also  be  written  for  this  case  as 


*.“2,  ,_uccv  2  . 
nxt  [(E^^  )  + 


w/x  3e„/30^ 


(348) 


where  W36y/36^  >  0  for  an  attractive  force,  and  where 


tan  8 


ucc 

vxvx 


ucc 

xt 


ucc 

xt 


UCC/  ucc 
xt  '  xt 

(349) 

d0^/d0*(d0^/d0*  -  1) 

E*^^  <  0 
xt 

(350) 

'  d^0  /d0*^ 

X  t 

>  0 
xt 

(351) 

where  and  ^xt  are  given  by  equations  (135)  and  (136).  Therefore 


tan  8 


ucc 

vxvx 


UCC/  ucc 
xt  '  xt 


ucc 

vxvx 


=  8 


c 

vxvx 


-  + «« 


(352) 


where  8^-yx  given  by  equation  (140).  The  phase  angle  condition  for  Newton's 
law  of  motion  for  an  ultrafast  mechanical  process  in  coherent  spacetime  is  ob¬ 
tained  from  equations  (143),  (163)  and  (168)  or  directly  from  equation  (346)  to  be 


*=  0  +  8“^^^  +  Tf/2  -  20* 

ax  X  vxvx  t 


(353) 


=  0 


-  20*  + 


xt 


where  S^t  ^®  given  by  equations  (140)  and  (141).  Equivalently,  equation  (353) 
can  be  rewritten  as 

2(«x  -  ^  'xt  =  ®W  (354) 

Equations  (348)  through  (354)  are  equivalent  to  the  complex  number  form  of 
Newton's  dynamical  law  of  motion  given  by  equation  (156)  which  for  an  ultrafast 
attractive  potential  in  coherent  space  and  coherent  time  is  written  as 

-  mx/t^(-  e"^*"  +  =  -  W/5  3e„/3e^  (355) 
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(356) 


with  W36y/36^  >  0  .  For  a  free  particle  moving  In  coherent  spacetime 
30„/30  -  0 

W  X 

and  two  possible  solutions  to  equation  (355)  can  be  found 


®x  "  *^1 


®x  - +  *^2 


(357) 

(358) 


where  Ci  and  C2  are  constants.  These  solutions  can  also  be  deduced  from  equa¬ 
tion  (348).  Equation  (357)  represents  a  state  of  rest  for  Internal  motion,  and 
equation  (358)  represents  a  state  of  uniform  motion  In  Internal  spacetime. 


For  a  repulsive  force,  equations  (122)  and  (236)  give  Newton's  law  of 
motion  as 


mt  ^  sec  sin  d/d0*(sln  0*  esc  0  x/t  d6  /d0*) 

VXVX  tt  t'  tt  XX  X  t 


(359) 


-  CSC  sin  6^  W/x  3e„/36^ 


An  alternative  expression  for  K:-.wton's  law  of  motion  for  repulsive  forces  is 
obtained  from  equations  (120)  and  (236)  as 


mxt”^  CSC  0^  sin^0*  esc  0^  d6^/de*  d6  ^/d0* 
VXVX  tt  XX  X  t  vx  t 


(360) 


.  -  CSC  6^  sin  6^  W/x  36„/3e^ 

where  W96y/96^  <  0  for  a  repulsive  force.  For  repulsive  forces  the  phase  angle 
equation  for  Newton's  law  of  motion  Is  obtained  from  equations  (119)  and  (163) 
to  be 

0+0  +  0'*’  -  2(0*  +  0*  )  *  8„  +  Buu  -  0  -  -  If  (361) 

X  XX  VXVX  t  tt'  W  WW  X  XX 


In  the  limiting  case  of  an  ultrafast  repulsive  mechanical  potential  In  coherent 
space  and  coherent  time  equations  (359)  and  (360)  become 


raxt"^  sec  d^e  /d0*^  =  -  W/x  a6u/9e^ 

VXVX  X  t  W  X 

raxt"^  CSC  0“*^*^'^  de  /d0*(de^/de*- 1)  =  -  W/x  30u/30^ 

VXVX  X  t  X  t  W  X 


(362) 

(363) 


where 


^  „ucc+  „ucc+ ,_ucc+ 

tan  0  =  E  ^  /F  ^ 

VXVX  xt  xt 


(364) 
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UCC+  .  ucc  .  c 
xt  xt  xt 

UCC+ 

xt 

ucct  ,  ucc  _  c 
xt  xt  xt 

UCC+ 

xt 

<  0 

(365) 

>  0 

(366) 

UCC+  .  ucc  .  _  ,/2  +  5 
vxvx  vxvx  xt 


(367) 


where  and  are  given  by  equations  (350),  (351),  (135)  and 

(136)  respectively,  and  where  is  given  by  equation  (141).  Equations  (362) 
and  (363)  for  repulsive  forces  can  be  rewritten  to  give  Newton's  dynanical  law  as 


-  w/x 


(368) 


or  equivalently 


(369) 


where  Udd^/dd^  <  0  for  repulsive  forces.  The  phase  angle  condition  for  Newton's 
law  of  motion  equation  (361)  becomes  for  this  special  case 

e„  =  2(9^  -  e")  +  6^^  (370) 


The  equivalent  complex  number  form  of  Newton's  law  of  motion  for  repulsive 
forces  is  given  by 


inx/t^(-  =  -  W/x  30„/3e^  (371) 

or  equivalently  as 

inx/t^(-  =  -  W/x  96^/39^  (373) 

m5/t^(-  E^j.  +  jF^j.)  =  -  W/x  90^/39^  (374) 


which  are  valid  for  a  repulsive  force  with  W39y/39x  ^  0  . 

E.  Simple  Harmonic  Oscillator  in  Broken  Symmetry  Spacetime. 

An  elementary  mechanical  system  that  can  be  used  to  describe  the  effects 
of  broken  symmetry  spacetime  on  a  dynamical  system  is  the  simple  harmonic  os¬ 
cillator.  The  complex  number  potential  energy  for  a  simple  harmonic  oscil¬ 

lator  in  broken  symmetry  spacetime  is  given  by 

W  =  l/2kx^  W  =  l/2kx^  9^  =  9j^  +  29^  (374) 

For  the  simple  harmonic  oscillator  potential  it  follows  from  equations  (4), 
(164)  and  (374)  that 
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®ilW  “  ®J( 


(375) 


Because  of  the  validity  of  equation  (375)  It  follows  for  the  slaple  harmonic 
oscillator  that  a  slow  mechanical  process  with  Bww  **  0  must  necessarily  occur 
In  incoherent  space  with  ■>  0  ,  and  an  ultrafast  mechanical  process  with 
Byy  ■>  ir/2  must  occur  In  coherent  space  with  v/2  .  The  simple  harmonic 

oscillator  Is  an  example  of  an  attractive  force  syst». 

The  equations  of  motion  for  the  simple  harmonic  oscillator  are  now  con¬ 
sidered  for  four  limiting  spacetime  conditions. 

Case  a.  Incoherent  Space  and  Incoherent  Time  (Slow  Mechanical  Process). 


Equation  (178)  gives 

d^x/dt^  +  kx  -  0 

which  is  the  standard  equation  for  the  simple  harmonic  oscillator. 


(376) 


Case  b.  Coherent  Space  and  Incoherent  Time  (Ultrafast  Mechanical  Process) . 


Equations  (76),  (290),  (296)  and  (374)  give 
-  ■  0 

-  +  k  -  0 

Xt"^  xt' 


K  .  “  9, 
xt  k 

where  E<^J  and  are  given  by  equations  (69)  and  (70). 

X  w  X  t 


(377) 


(378) 


(379) 


Case  c.  Incoherent  Space  and  Coherent  Time  (Slow  Mechanical  Process). 


Equations  (107),  (217),  (222)  and  (223)  give  with  0*  “  0 

-  m/t^(F^J  +  jE^^)  +  -  0 

-  mr^KE^^)^  +  (F^^)^]^^^  +  kx  =  0 

-  26*  -  0. 
vxvx  t  k 


(380) 


(381) 


(382) 


where  E^^  and  F^^  are  given  by  equations  (103)  and  (104). 

Case  d.  Coherent  Space  and  Coherent  Time  (Ultrafast  Mechanical  Process). 
Equations  (146),  (348),  (353)  and  (374)  give 
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(383) 


-  'xt  *  “ 

-  +  k  -  0  (384) 

xt  xt 

«xt  -  K  ■  \  <“» 

c  c 

where  and  fxt  are  given  by  equations  (135)  and  (136).  It  is  clear  from 
cases  b  and  d  that  the  simple  harmonic  oscillator  can  undergo  ultrafast  inter¬ 
nal  spacetime  motion  in  coherent  space,  and  for  these  two  cases  the  spring 
constant  itself  drives  the  internal  motion. 

4 .  CONCLUSION .  Newton’s  law  of  dynamics  can  be  developed  for  slow  and 
ultrafast  varying  mechanical  potential  functions  and  for  four  possible  varia¬ 
tions  of  the  spacetime  coordinates:  incoherent  space  and  incoherent  time,  co¬ 
herent  space  and  Incoherent  time,  incoherent  space  and  coherent  time,  and  co¬ 
herent  space  and  coherent  time.  This  yields  eight  possible  forms  for  Newton's 
law  of  motion.  The  ultrafast  mechanical  processes  occur  as  rotations  of  the 
complex  number  potential  function  in  an  internal  space  while  the  magnitude  of 
the  complex  number  potential  function  remains  fixed.  The  coherent  changes  of 
space  and  time  coordinates  occur  as  rotations  of  the  complex  number  coordinates 
In  an  internal  space  for  fixed  magnitudes  of  the  coordinates.  Internal  phase 
motions  may  possibly  play  an  important  role  in  the  development  of  nuclear 
rocket  engines. 
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ABSTRACT.  This  paper  develops  the  quantum  mechanical  theory  of  slow  and 
ultrafast  processes  that  occur  in  space  and  time  with  broken  internal  symmetries . 
For  a  slow  quantum  process  the  magnitude  of  the  complex  nuaber  wave  function 
changes  in  space  and  time, while  for  an  ultrafast  process  the  co^;>lex  number  wave 
function  rotates  with  a  constant  magnitude  in  an  internal  space.  For  incoherent 
space  and  time  the  changes  in  coordinates  occtir  as  the  stretch  or  contraction  of 
the  magnitudes  of  the  complex  nuidaer  spacetime  coordinates,  tdiereas  for  coherent 
spacetime  the  changes  of  the  coordinates  occtir  as  rotations  in  an  internal  space. 
The  case  of  a  slow  process  in  incoherent  space  is  Just  the  standard  case  of 
quantum  mechanics.  The  Schrddinger  and  Dirac  equations  are  developed  for  slow 
and  ultrafast  quantm  processes  in  incoherent  and  coherent  spacetime.  For  the 
case  of  an  ultrafast  process  in  coherent  spacetime  Schrddinger 's  equation 
describes  the  internal  phase  angle  of  the  wave  function  in  terms  of  the  internal 
phase  angles  of  the  space  and  time  coordinates.  By  way  of  example  Schrddinger ' s 
equation  for  the  ultrafast  motion  of  a  simple  hai^nic  oscillator  is  developed 
for  coherent  spacetime. 

1 .  IMTRODUCTION .  Ultrafast  processes  have  become  important  diagnostic 
tools  for  the  investigation  of  atomic  and  molecular  phenomena  diet  occur  in  gases 
and  condensed  matter,  and  the  use  of  ultrashort  picosecond  (10*^  sec)  and 
femtosecond  (10'^^  sec)  laser  light  pulses  has  become  the  basic  technique  for 
studying  ultrafast  chemical  and  physical  processes . Physics,  chemistry  and 
biology  have  profited  from  ultrafast  pulse  technology  because  these  light  pulses 
can  be  used  to  examine  atomic  and  molecular  processes  such  as  molecular 
vibrations  in  liquids,  phonon  and  exciton  decay  in  solids,  time  variation  of 
plasma  densities  in  gases  and  solids,  chemical  reactions  at  a  molecular  level  and 
temperature  fluctiiation  processes  among  many  others. In  the  visible  region 
of  ^e  electromagnetic  spectrum  the  Heisenberg  uncertainty  principle  puts  a  limit 
of  a  femtosecond  on  the  duration  of  a  laser  ll^t  pulse,  while  in  the  x  ray 
region  the  limit  of  temporal  resolution  is  an  attosecond  (10~^’  sec)  so  that 
processes  that  occur  faster  by  several  orders  of  magnitude  can  in  principle  be 
observed  as  technology  improves . 

Ultrafast  processes  require  an  appropriate  thermodynamic  description.  A 
relativistic  gauge  theory  of  thermodynamics  has  been  developed  that  determines 
the  renormalized  pressure  and  Grdnelsen  parameter  in  terms  of  the  corresponding 
theoretically  predicted  imrenormalized  values  of  these  two  quantities .  On  the 
basis  of  this  theory  it  was  suggested  that  thermodynamic  quantities  such  as 
internal  energy,  entropy  and  pressure  are  associated  with  broken  internal 
symmetries  and  must  be  represented  as  complex  numbers  in  an  internal  space. 
Space  and  time  coordinates  also  have  broken  internal  symmetries  and  must  likewise 
be  represented  by  complex  numbers  in  an  internal  space. This  is  also  true  of 
kinematical  and  dynamical  quantities  such  as  velocity,  momentum,  acceleration  and 
force. 

A  theory  of  ultrafast  processes  has  been  developed  that  describes  the 
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ultrafasc  variation  of  physical  quantities  as  rotations  in  an  internal  space 
while  the  aagnitudes  of  the  phyeieal  quantities  are  fixedt^^*^^  Xhemodynwiq 
qu^msltiee  euch  Mr  eautcopy,  InterMrl  energy  and  voluae  aH  cdiipieac  MMbers'^hat 
change  as  rotations  in  an  internal  space  for  ultrafast  processes. It  has 
been  suggested  that  thermodynamic  engines  can  be  developed  whose  cycles  occur  as 
changes  in  the  internal  phase  angles  of  the  entropy,  internal  energy  and  volume 
while  the  magnitudes  of  the  quantities  remain  fixed. Quantum  mechanics  can  be 
developed  with  coordinates  that  have  broken  Internal  symmetries  and  the  SchrOdln- 
ger  and  Dirac  equations  have  been  formulated  for  partially  coherent  spacetime. ^ ® ^ ° 
Applications  to  the  problems  of  a  particle  confined  to  a  box  and  the  simple  har¬ 
monic  oscillator  have  been  considered  for  the  external  motion  in  broken  symmetry 
spacetime  and  for  Internal  coordinate  motion  where  the  magnitudes  of  the  coordi¬ 
nates  are  held  fixed.*® 

For  broken  symmetry  spacetime  the  space  and  time  coordinates  are  written 
as  complex  numbers  in  an  internal  space  in  the  following  way 

a  =  a  exp(j0^)  t  =  t  exp(j8^)  (1) 

where  a  =  x  ,  y  ,  z  for  cartesian  coordinates,  a  =  r  ,  i|>  ,  z  for  cylindrical  polar 
coordinates  and  a  =  r  ,  <}i  ,  i);  for  spherical  polar  coordinates.  All  physical  quan¬ 
tities,  with  the  exception  of  the  light  speed  in  the  vacuum,  have  broken  inter¬ 
nal  symmetries  and  must  be  represented  as  complex  numbers  in  an  internal  space. ^ ® 
This  Includes,  for  example,  pressure,  entropy,  energy  and  magnetic  and  electric 
field  strengths.  Therefore  for  the  case  of  quantum  mechanics  the  wave  function 
must  be  represented  as  a  complex  number  in  internal  space  as  follows*^ 

?  =  exp(je^)  (2) 

Strictly  speaking,  the  value  of  the  internal  phase  angle  of  the  time  is  associ¬ 
ated  with  the  particular  physical  quantity  which  is  varying  with  time,  so  that 
for  the  case  at  hand*^ 

t  =  t  exp(je^)  (3) 

4' 

where  0^-  =  internal  phase  angle  of  time  that  is  associated  with  the  time  varia¬ 
tion  of  the  wave  function  Y  .  Space  is  taken  to  be  homogeneous  so  that  the  in¬ 
ternal  phase  angle  of  time  0^  is  independent  of  the  internal  phase  angles  0^  of 
the  space  coordinates.  In  other  cases,  such  as  particle  dynamics  and  kinematics, 
the  space  and  time  coordinates  are  not  independent  and  the  internal  phase  angles 
of  the  space  coordinates  0a  are  each  associated  with  a  corresponding  internal 
phase  angle  of  time  0®  for  a  =  x  ,  y  ,  z  with  80^/90°  ^  0  .  But  for  the  case  of 
the  wave  equations  of  quantum  mechanics  the  time  and  space  coordinates  are  taken 
to  be  independent  parameters  so  that  0|[  and  0^  are  unrelated  quantities  with 
90a/90t  =  0  .  In  general 

4'  =  4'(a,0^,t,0^)  (4) 

0^  =  0^(a,0^,t,0^)  (5) 
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where  a  «  x  ,  y  ,  z  .  Proa  equations  (1)  and  (2)  it  follows  that^**^^ 


®oo  ^aa}^  (6) 

-  CSC  6^^  ode^  exp(j(0^  +  e^^)]  (7) 

dt  -  sec  dt  exp[J(eJ  +  bJ^)1  (8) 

■  CSC  bJ^  tdej  exp[j(e|^  +  bJj.)]  (9) 


d¥  ■  sec 

B^^  d¥  exp[j(0^  +  B^^)l 

(10) 

■  CSC 

6^^  ¥d6^  exp[j(0^  +  B^^,)] 

(11) 

tan 

^aa" 

a30  /3a 
a 

(12) 

tan 

b"  - 

tt 

t30^/3t 

(13) 

tan 

6  ^ 
P¥¥ 

¥36^/3¥ 

(lA) 

¥ 

The  internal  phase  angle  of  the  time  is  associated  with  the  time  variation  of 
the  wave  function  ¥  . 

The  first  derivatives  of  the  wave  function  with  respect  to  space  and  time 


are  written  as 

v^  -  v^  exp(j0^^)  =  3¥/3a  (15) 

u  «  u  exp(j0^)  ■  3¥/3t  (16) 

where  a  “  x  ,  y  ,  z  .  For  the  space  derivatives  in  equation  (15)  the  magnitudes 
Va  can  be  written  as^^ 

v^  =  sec  B^y  cos  6^^  3¥/3a  (17) 

=  CSC  6^^  cos  ¥  30^/3o  (18) 

=  sec  B^^  Sin  B^^^^  a  ^  3¥/30^  (19) 

=  CSC  B^y  sin  Bjjjjj  ¥/a  (20) 


and  the  Internal  phase  angles  as 


0 

va 


+  ^4,4, 


0  -  B 

a  aa 


(21) 
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%ihere  «nd  Byy  are  given  by  equations  (12)  and  (lA)  respectively.  The  nag- 


u  • 

sec 

®y4' 

cos 

- 

CSC 

cos 

- 

sec 

sin 

b’' 

tt 

a 

CSC 

®yy 

sin 

4* 

•^tt 

while  the  internal  phase  angle  is  given  by 

«u  ■  S  *  -  <  -  “It  <“) 

y  4* 

where  6^^  given  by  equation  (13)  and  0^  -  internal  phase  angle  of  time  that 
Is  associated  with  the  time  variation  of  the  wave  amplitude  4*  . 

Four  limiting  forms  of  the  first  derivative  of  the  wave  function  with  re¬ 
spect  to  the  spatial  coordinates  will  now  be  considered. 

Case  1.  Slow  Quantum  Process  and  Incoherent  Space. 

This  is  described  by 


6^4,  =  0 


6=0 

a 


6  =0 
aa 


Then  equations  (17)  and  (21)  give 

v®^  =  a4'/9a  e 


0®^  =  0 
va 


Case  2.  Ultrafast  Quantum  Process  and  Incoherent  Space. 
This  case  is  given  by 


V  ■  =  0 


6  -  0 
aa 


Equations  (18)  and  (21)  become  for  this  case 


=  'l'3e/3a 


+  ^/2 

va  4' 


or  equivalently 


=  j?36^/3a 


Case  3.  Slow  Quantum  Process  and  Coherent  Space, 
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The  following  conditions  hold  for  this  case 

0^-0  -  0  -  Tr/2  (32) 

and  equations  (19)  and  (21)  become 

_  e  -  Tr/2  (33) 

a  a  vo  a  ' 

or  equivalently 

0’‘'/30„  (34) 

Case  4.  Ultrafast  Quantum  Process  and  Coherenc  Space. 

This  case  is  described  by 

“  ^/2  6^^  “  it/2  (35) 

and  equations  (20)  and  (21)  give 

v'^‘^  -  S'/a  ae^/30  6“*^  =  6^-0  (36) 

o  S'  a  va  S'  a 

or  equivalently 

v”*"  =  v/s  ae^/ae^  (37) 

These  are  the  four  limiting  cases  associated  with  the  first  derivative  with  re- 
spect  to  the  spatial  coordinates.^^ 

Now  the  four  limiting  conditions  will  be  given  for  the  first  derivative  of 
the  wave  function  with  respect  to  time.^^ 

Case  1.  Slow  Quantum  Process  and  Incoherent  Time. 

This  case  is  given  by 

0,j,  =  0  =  0  0^  =  0  ° 

and  equations  (22)  and  (26)  become 

u®^  =  ay/at  0®^  =  0  (39) 

u 

Case  2.  Ultrafast  Quantum  Process  and  Incoherent  Time. 

This  case  -is  described  by 

By^  =  TT/2  0^  =  0  ^  ° 
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Equations  (23)  and  (26)  then  give 

-  yae^/at  (4i) 

or 

-  j?9e^/3t  (42) 

Case  3.  Slow  Quantum  Process  in  Coherent  Time. 

The  following  conditions  are  valid  for  this  case 

64,  =  0  6^4,  =  0  6^^  =  tt/2  (43) 

while  equations  (24)  and  (26)  give 

^sc  ^  -  ■"/2 

t  u  t 

or 

u®^  =  -  j/t  34'/30^ 

Case  4.  Ultrafast  Quantum  Process  in  Coherent  Time. 

This  case  is  described  by  the  following  conditions 

64,4,  =  Tt/2  6^^.  “  Tr/2  (46) 

and  equations  (25)  and  (26)  give 

u^*"  =  Y/t  36^/36^  =  S  "  ®t 

which  can  be  rewritten  as 

=  ?/t  30^/30^  (48) 

The  second  derivatives  of  the  wave  function  with  respect  to  the  spatial 
coordinates  will  now  be  represented  in  four  general  forms  which  can  be  special¬ 
ized  to  four  limiting  cases  of  physical  interest  corresponding  to  slow  and  fast 
quantum  processes  in  incoherent  and  coherent  space.  The  second  derivative  of 
the  wave  function  with  respect  to  space  coordinates  is  written  as^^ 

3^?/3a^  =  dv^/da  (49) 

where  a  =  x  ,  y  ,  z  ,  and  where  v^  is  defined  in  equation  (15). 

Case  1.  Slow  Quantum  Process  in  Incoherent  Space. 

A  general  expression  for  the  second  spatial  derivative  of  the  wave  func- 


(44) 

(45) 
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tion  will  be  derived  which  can  be  used  to  pass  to  the  limit  of  a  slow  quantum 
process  in  incoherent  space  which  is  described  by 


S  “  °  0  °  ®aa  “  ° 

for  a  -  X  ,  y  and  z  .  Equations  (1),  (2),  (15),  (17)  and  (49)  give 

^o  “  ®vava  ®ao  <51) 

“  ®vava  ‘'°®  ^oa  3/3“ (sec  0^^  cos  ay/ 3a)  (52) 

while  equations  (21)  and  (49)  give 

0.  =  6  +B  -0-6  (53) 

Ca  va  vava  a  aa  ^  ' 

“  6ui  +  Puiu,  +  6  -  2(0  +  6  )  (54) 

y  yy  vava  a  aa  '  * 

where  is  given  by 

tan  B  =  V  30  /3v  (55) 

vava  a  va  a 

where  Vq,  is  given  by  equation  (17)  and  0^gj  by  equation  (21).  For  this  case  0^j, 
is  a  small  ntnnber.  In  the  limiting  case  of  a  slow  quantum  process  in  incoherent 
space  equation  (50)  is  valid  and  equations  (52)  and  (54)  become 


5®^  a  3^y/3a^ 


which  is  the  conventional  result. 


-  0 
Ca 


Case  2.  Ultrafast  (Quantum  Process  in  Incoherent  Space. 

This  section  derives  a  general  equation  for  the  second  derivatives  of  the 
wave  fuimtloa  with  respect  to  the  space  coordinates  which  can  be  utilize^  to 
obtain  the  limiting  case  of  an  ultrafast  quantum  process  in  Incoherent  space 
which  is  defined  by 


B,,  =  1./2  6^  -  0 


6  -  0 
aa 


where  0y  is  now  a  variable.  From  equations  (18)  and  (49)  it  follows  that 

^  =  CSC  B  cos  B  v  90  /3a  (58) 

a  vava  aa  a  va 

=  CSC  B  cos^B  CSC  Bu-y,  y  96y,/9a  90  „/9a  (59) 

vava  aa  yy  y  va 

where  Bvava  given  by  equation  (55)  with  v^  given  by  equation  (18)  and  where 
equation  (21)  gives 

se^^/aa  -  a/aa(e,  +  s„  -  <60) 
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The  corresponding  Internal  phase  angle  for  the  second  spatial  derivative  is 
given  by 


0.-0  +B  -e-e  (61} 

5o  VO  vova  a  ao  '  ^ 

*  0™  +  e»,y,  +  0  -  2(0  +  B  )  (62} 

*P  Tt  vava  a  oo  ' 

For  an  ultrafast  quantum  process  it  is  convenient  to  Introduce  an  alternative 
representation  of  the  second  derivatives  which  is  given  by 

-  (.1  (63) 

-  3^?/3a^  -  3v  /3o 

a 

where 

<“) 

Then  it  follows  that 

-  -  <==<=  ev„v.  “=  Saa  ''a  <“> 

■  -  ®vava  ««  '' 


\  *  6vava  '  '  '  <‘8) 

for  the  general  case. 

In  the  limiting  case  of  an  ultrafast  quantum  process  in  incoherent  space, 
equation  (57)  is  valid  and  equations  (54)  and  (59)  become 

=  CSC  6*^^  4'(3e„,/3a)^  (69) 

a  vava  n  vy'  / 

+  0“^  +  t^/2  (70) 

Ca  Y  vava 

From  equations  (30)  and  (55)  it  follows  that 

tan  =  eJ;^/f7  (71) 

vava  ¥0  4'a  ' 


where 


>  0 

'Va 


f“"-  <  0 
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Froa  equation  (71)  it  follows  that 


C8C  6 


Equations  (69)  and  (7A)  give 

C  ”  (75) 

For  the  signs  chosen  In  equations  (72)  and  (73)  It  follows  from  equation  (71)  that 

=  tt/2  +  ic^  (76) 

vava  yo  ' 


where  icw„  is  a  small  positive  number  defined  by 


tan 


Equations  (70)  and  (76)  give 

and  equations  (64)  and  (65)  give 

(79) 

■  S  *  So 

SO  that  Is  a  small  number.  Finally,  from  equation  (49)  and  (69)  through 

(80)  It  follows  that 

which  Is  the  equivalent  complex  number  representation  of  equations  (79)  and  (80) . 
Case  3.  Slow  Quantum  Process  In  Coherent  Space. 

An  expression  is  derived  for  the  second  derivative  of  the  wave  function 
with  respect  to  the  spatial  coordinates,  which  can  be  used  to  pass  to  the  limit 
of  a  slow  quantum  process  in  coherent  space  whose  characteristics  are 


=  0 


Byvy  =  0 


B  =  n/2 
aa 


where  6^  is  variable.  From  equations  (49),  (19)  and  (54)  it  follows  that 

C  =  sec  B  sin  B  a  ^  3v  /3e  (83) 

o  vava  aa  a  a 

■  "'■=  ^vava  3/3e„(sec  B„  sin  n'*  3»/3e^)  (84) 
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(85) 


e  «0^a  _  2(6  +6  ) 

So  f  %avo  '“a  act' 

trhere  Syova  given  by  (19),  (21)  and  (55).  In  this  case  It  Is  convenient  to 
Introduce  another  representation  of  the  second  spatial  derivative,  namely 

l,  -  e. 

-  -  9v  /9a 

where 

c;  -  -  (87) 

The  limiting  cas**  of  a  slow  quantum  process  In  coherent  space  Is  obtained 
from  equation  (82)  which  combined  with  equations  (84)  and  (85)  gives 

5®^  =  sec  a~^  3^'{'/30^  (89) 

a  vava  o 

0®*=  =  6®'"  -  20  -  IT  (90) 

5a  vavo  o 


Equations  (33)  and  (55)  give 


^  -SC  _SC/_SC 

tan  8  “  Eu,  /F^ 

vova  'Va  'Va 


where 


34'/3e 


4*0 


F®^  =  d^'v/de^ 

4*0  a 


>  0 
4*0 


sec  6®'"  =  [(E®"")^  +  (F®‘')^]^^^/F®''  (94 

vava  4'a  4'a  4'a 

SC 

and  therefore  ByaYa  1®  ®  small  positive  angle.  Equations  (89),  (92)  and  (94) 
give 

C®^'  =  a"^[(E®‘^)^  +  (F®'")^]^''^  (95 

a  4'a  4'a 


The  alternative  description  of  the  se'ond  derivative  with  respect  to  space  is 
obtained  from  equations  (87),  (88),  (90)  and  (95)  as 

=  -  a-2[(E®^)^  +  (F®^)^]^/2  (96) 

eSc'  _  sc  _  20  (97^ 

5a  vava  a 
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where  Br^  Is  seen  to  be  s  sasll  angle.  The  complex  nuaber  that  Is  equivalent 
to  equations  (49),  (86),  (96)  and  (97)  Is 

C  ■  - +  jC’  <**> 

which  gives  the  second  order  spatial  derivatives  of  the  wave  function  for  a 
slow  quantum  process  in  coherent  space. 

Case  4.  Ultrafast  Quantum  Process  and  Coherent  Space. 

This  section  develops  a  representation  for  the  second  derivative  of  the 
wave  function  with  respect  to  the  space  coordinates  which  can  be  utilized  to 
attain  the  limit  of  an  ultrafast  quantum  process  in  coherent  space  whose  de¬ 
scription  is 


-  Tr/2 


aa 


The  magnitude  and  internal  {diase  angle  of  the  second  derivative  is  obtained 
from  equations  (20) ,  (49)  and  (54)  to  be 


®»ava 

-  8,.v«  ■=“  ’V/’®. 


(100) 


(101) 


+  ^ava  “  2(6„  +  B„„) 


(102) 


where  Byava  given  by  equations  (20)  and  (55),  and  where  from  equation  (31) 
it  follows  that 


(103) 


A  different  expression  for  the  second  derivative  can  be  obtained  from  equations 
(83)  and  (20)  and  is 


C  »  sec  B  sin  B„_  a  3v  /3e 
a  vava  <xa  a  a 


(104) 


sec  B  sin  B  a"^  3/36^  (esc  B^^  sin  B„„  'f/a  (105) 

vava  aa  a  tt  aa  t  a 


A  comparison  of  equations  (101)  and  (105)  gives 


®vava  “ 

®vava  ' 


(106) 


(107) 


(108) 
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where 


C„  -  sin  8„„  esc  t/a  36^/30^  30„^/30^ 
To  oo  TT  T  o  VO  a 


-  3/30^ (CSC  B*^  sin  6  T/o  30^/30^) 
To  O  TT  OO  T  O 


(109) 

(110) 


where  30ya/30Q  is  given  by  equation  (103)  and  where  CipQ  <  0  .  Therefore, 

C  c  c  ^  CSC  B  sin  B  (111) 

o  -  '^Ta  vovo  OO  '  ' 

■  o  ^  sec  B  sin  B  (112) 

To  vovo  oo  ' 


which  can  be  rewritten  as 


(113) 


It  is  convenient  for  the  case  at  hand  to  write  the  second  derivative  of  the  wave 
function  with  respect  to  space  coordinates  in  equation  (49)  as 


«cp(J6j^)  -  {•  exp(jej^) 

-  9^5/90^  -  9v  /9S 
a 


(114) 


where 


a  a 

01  =  Oj,  +  ir 

Co  5a 


(115) 

(116) 


which  gives  a  useful  alternative  description  of  the  second  derivative  with 
respect  to  space. 

For  an  ultrafast  quantum  process  in  coherent  space  described  by  equation 
(99)  it  follows  from  equations  (101),  (105),  (113)  and  (115)  that 


r^c'  „uc  ,  2  „uc 

«vav<. 

■  -  Cva  C 


(117) 

(118) 

(119) 

(120) 


where 


'Ja  *  39/96^96^/96^  -  1) 


-  9^6  /99^ 
Ta  T  a 


<  0 
Ta 


>  0 
Ta 


(121) 

(122) 
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(123) 


(124) 


From  equations  (36)  and  (55)  or  directly  from  equation  (106)  It  follows  that 


tan 

vava 

-  Wa 

(125) 

CSC 

vovo 

-  + 

(126) 

sec 

vovo 

-  [(eJ^)^  + 

(f“)^i‘^^/f- 

(127) 

Because  of  the  choice  of  signs  In  equations  (121)  and  (122)  It  follows  that 

(128) 


-  -  ir/2  +  6^ 
vavo  Va 


where  5^^  >  0  ,  so  that 


tan  -  F“/|E“'| 


(129) 


The  Internal  phase  angle  of  the  second  derivative  then  follows  from  equations 
(99),  (102),  (116)  and  (128)  as 


-  0U,  +  6“^  -  28  -  »/2 

Co  V  vovo  a 

«  -  20  +6™  -  ir 

y  a  to 


®?o  "  +  Cvo  -  26  +  1^/2 

Co  Y  VOVO  o 


0iif  ~  26  + 

f  a  to 


(130) 

C13I) 

(132) 

(133) 


uc 

so  that  6^q  is  a  small  number.  The  complex  number  second  derivative  with  re¬ 
spect  to  space  coordinates  that  corresponds  to  equations  (120)  and  (133)  is 
given  by 

„  ^g2-^g_2juc  ^  _  ja^e^/ae^]  (134) 


»  -  t/o2(-  +  iF“*^) 

'  to  '’to'' 


(135) 


for  an  ultrafast  quantum  process  In  coherent  spacetime. 

In  terms  of  the  elementary  expressions  for  the  space  and  time  derivatives 
of  the  wave  function  it  is  possible  to  formulate  the  quantum  mechanical  theory 
of  ultrafast  processes  in  spacetime  with  broken  internal  symmetries.  Briefly 
the  outline  of  this  paper  is  as  follows:  Section  2  examines  the  time  indepen¬ 
dent  SchrOdinger  equation  for  slow  and  ultrafast  quantum  processes  in  cartesian 
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space  with  broken  Internal  symmetries.  Section  3  investigates  the  time  dependent 
Schrddinger  equation,  and  Section  4  considers  the  slow  and  fast  forms  of  the 
Dirac  equation  in  asymmetric  spacetime. 

2.  TIME  INDEPENDENT  SCHRODINGER  EQUATION  IN  BROKEN  SYMMETRY  CARTESIAN 
COORDINATES .  This  section  develops  the  time  independent  SchrOdlnger  equation 
for  slow  (Incoherent)  and  ultrafast  (coherent)  spatial  variations  of  the  quantum 
wave  function  In  cartesian  space  with  broken  Internal  symmetries.  The  space  co¬ 
ordinates  also  can  vary  incoherently  or  coherently.  The  Incoherent  variation  of 
the  space  coordinates  corresponds  to  changes  In  the  magnitudes  of  the  coordi¬ 
nates,  while  the  coherent  variation  of  the  space  coordinates  corresponds  to  ro¬ 
tations  of  the  complex  number  coordinates  In  an  inteinal  space.  For  a  slow  (In¬ 
coherent)  quantum  process  the  wave  function  Is  a  scalar  in  Internal  space  and  changes 
in  magnitude,  while  for  an  ultrafast  (coherent)  quantum  process  the  complex  number 
wave  function  rotates  in  an  Internal  space.  The  standard  forms  of  the  equations 
of  quantum  mechanics  correspond  to  an  incoherent  process  in  incoherent  space. 
Quantum  mechanics  with  complex  number  wave  functions  In  partially  coherent  space- 
time  has  already  appeared  In  the  literature. Various  forms  of  SchrOdlnger ' s 
equation  have  been  written  for  coherent  spacetime  where  the  change  In  the  complex 
number  coordinates  is  in  the  form  of  a  rotation  in  internal  space. This  section 
develops  Schrbdlnger's  time  independent  equation  for  slow  and  ultrafast  quantum 
processes  in  incoherent  and  coherent  space,  so  that  four  special  cases  are  con¬ 
sidered:  slow  quantum  process  in  incoherent  space,  slow  quantum  process  in  co¬ 
herent  space,  ultrafast  quantum  process  in  incoherent  space,  and  an  ultrafast 
quantum  process  in  coherent  space.  A  slow  quantum  process  is  assumed  to  have  a 
wave  function  that  varies  incoherently  in  space,  while  an  ultrafast  process  is 
assumed  to  have  a  wave  function  that  varies  coherently  in  space. 

A.  SchrOdinger’s  Equation  and  Linear  Momentum  in  Spacetime 
with  Broken  Internal  Symmetries. 

Schrodinger ’ s  equation  for  a  slow  process  in  incoherent  space  is  treated 
fully  in  the  literature. ^ ”or  the  general  case  of  an  asymmetric  wave  func¬ 
tion  in  asymmetric  spacetime  Schrodinger ' s  time  dependent  equation  is  written 
as^^ 


-  ft^/(2p)V^'l'  +  V?  =  i^ia^/3t  (136) 

-2 

where  R  =  h/(2Ti),  h  =  Planck's  constant,  p  =  mass  of  particle,  V  =  Laplacian 
expressed  in  terms  of  complex  number  coordinates,  '1'  =  complex  number  wave  func¬ 
tion  which  is  represented  by  equation  (2) ,  V  =  complex  number  potential  which 
is  written  as^^ 

V  =  V  exp(j0^)  (137) 

and  where  t  =  complex  number  time  as  described  by  equation  (1)  or  more  carefully 
by  equation  (3) . 

If  the  wave  function  is  written  in  a  form  that  is  suitable  for  a  station¬ 
ary  state  as^’ 
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?  -  U(o)  €acp(-  (138) 

then  equation  (136)  reduces  to  the  following  conplex  nuid>er  tine  independent 
Schrodinger  equatiem 

-  /i^/(2u)V^U  +  W  -  EU  (139) 

where  the  conplex  nunher  energy  E  and  wave  function  U  are  written  as 

E  -  E  exp(jeg)  U  -  U  exp(jey)  (lAO) 

Schrodinger ' 8  equation  (139)  is  obtained  from  the  law  of  the  conservation  of 
energy^’ 

tj  p^/(2u)  +  V]U  -  EU  (141) 

a  ® 

combined  with  the  following  momentum  operator  representation  for  complex  num¬ 
ber  cartesian  coordinates^ 

p  *  -  ±hd/da  (142) 

p  0  «  -  ifi3U/35  -  -  ifiw  (143) 

w  ■  w  exp(j6  )  «  3U/3o  v  (144) 

o  a  wa'' 

where  w^  and  6^  are  given  by 

'*»*•**  cos  6^^  3U/3e 

-  CSC  cos  6^  If  3e^/3a  (146) 

-  sec  Bjjjj  sin  6^^  a  ^  311/36^  (147) 

-  CSC  Buu  sin  6^^  U/a  30„/30„  (1^8) 


wa 


®U  ®UU  “  "  ®aa 


(149) 


For  a  slow  process  in  incoherent  space  equations  (17)  and  (143)  give 


p®^U  =  -  ifi3U/3a 


(150) 

(151) 

which  is  the  standard  result.  For  an  arbitrary  variation  of  the  complex  number 


(p®^)^U  =  -  fi^3^U/3o^ 
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wave  function  and  for  coherent  space  coordinate  variations  the  momentum  operator 
equations  (142)  and  (143)  become  ^ 


-  ij/i/o  3/36^  p^U  -  tjh/a  3U/30^  (152) 

Applying  the  momentum  operators  in  equations  (142)  and  (152)  twice  gives  the 
following  results  for  coherent  space  variations*^ 

p2  .  _  .  fi^/a^(3^/30^  -  j3/30  )  (153) 

a  a  a 

p^U  -  -  ft^3^U/3a^  (154) 

-  ft^/a^(3^U/30^  -  j3U/30^)  (155) 


Equation  (155)  will  subsequently  be  specialized  to  the  case  of  an  ultrafast 
quantum  process  where  U  changes  coherently. 


SchrOdinger ' s  equation  (139)  can  be  written  as 


V^U  +  k^U  =  0 

(156) 

where 

ic^  =  2y/fi^(E  -  V) 

(157) 

with 

k  =  k  exp(j0j^) 

(158) 

The  components  of  equation  (157)  are  written  as 

k^  cos(20j^)  =  C 

(159) 

k^  sin(2ej^)  =  D 

(160) 

where 

C  =  2\i/h^(E  cos  Qg  -  V  cos  0y) 

(161) 

D  =  2p/ft^(E  sin  9g  -  V  sin  0^) 

(162) 

so  that  k  and  are  given  by 

tan  20j^  =  D/C 

(163) 

k  »  (C^  + 

(164) 
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(165) 


A  crude  approxination  derived  from  equation  (157)  is 

-x.  -  2u/fi^(E  -  V) 

8 

■  ®E  ■  «V  <>“) 

which  assumes  that  the  internal  phase  angles  of  the  component  terms  in  equation 
(157)  are  equal. 

For  cartesian  coordinates  Schrodinger's  equation  (156)  is  written  as 

-  (3^/3x^  +  3^/3y^  +  3^/3z^)U  =  k^U  (167) 

where  x  ,  y  and  i  are  complex  number  cartesian  coordinates  that  are  represented 
by  equation  (1).  Combining  equations  (49)  and  (167)  gives  SchrOdinger's  equa¬ 
tion  as 

-In-  k^U  (168) 

a  “ 


where 


n  =  n  exp (10  ) 

a  a  noi 


3^U/3o^  -  3w  /35 
a 


(169) 


The  complex  number_second  derivative  n^  is  homologous  to  the  complex  number 
second  derivative  Ca  ^9uatlons  (49)  through  (135).  All  of  the  relations  that 
appear  in  equations  (49)  through  (135)  involving  Ca  have_analogous  ex¬ 

pressions  with  Ha  ^nd  and  correspond  to  the  replacement  ?  U  and  Va  Wq  • 
Equation  (168)  can  be  approximated  by  assuming  that  the  internal  phase  angles 
of  each  term  in  equation  (168)  are  equal.  Because  the  internal  phase  angles  of 
the  second  derivative  n^  can  have  a  zero  or  a  ±  x  added  as  shown  in  Section  1  in 
equations  (54),  (65),  (88)  and  (116),  the  approximate  solution  to  equation  (168) 
requires  that  the  four  possible  states  associated  with  a  slow  or  ultrafast  quan¬ 
tum  process  in  Incoherent  or  coherent  space  be  considered  separately.  SchrOdln- 
ger's  equation  (168)  can  be  written  approximately  as^^ 

-  (±1)1  =  k^u  (170) 

a  “ 

0  ±  (x,0)  =  20,  +  0,,  (171) 

no  k  U 


where  0jj  and  k  are  given  by  equations  (163)  and  (164),  and  where  0^0  is  given  by 
equations  (54)  and  (55)  as 


no 


0„  +  +  3  -  2(6  +  3  ) 

U  UU  wawa  a  aa 


(172) 


where  in  a  fashion  similar  to  equations  (14)  and  (55) 


tan  3^  =  U30y/3U 


tan  3  =  w  36  /3w 

wawa  a  wa  a 


(173) 
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From  equation  (54)  It  follows  that  equation  (171)  can  be  written  as 


6„„  +  S  -  2(0  +  6  )  ±  (ir.O)  «  20. 

UU  wawa  a  aa  k 


(174) 


The  minus  sign  in  the  parenthesis  in  equation  (170)  appears  whenever  the  ±  ir 
appears  in  the  phase  angle  conditions  given  by  equations  (171)  or  (174).  Further 
approximations  to  Schrbdinger 's  equations  (170)  through  (174)  can  be  obtained  by 
writing 


-  (±1)1  =  k^u 

a  “  ® 


(175) 


0  ±  (tt.O)  =  20,  +  e„ 

na  ks  U 


(176) 


+  6  -  2(0  +  6  )  ±  (tt.O)  =  20. 

UU  wawa  a  aa  ^ 


(177) 


where  kg  and  0]jg  are  given  by  equations  (165)  and  (166). 

B.  Four  Possible  Cases  of  Quantum  Processes  in  Space  with 
Broken  Internal  Symmetries. 

The  four  possible  states  associated  with  the  time  independent  Schrbdinger 
equations  (170)  and  (171)  that  correspond  to  slow  and  ultrafast  quantum  pro¬ 
cesses  and  to  incoherent  and  coherent  spatial  coordinate  variations  will  now  be 
considered. 

Case  a.  Slow  Quantum  Processes  in  Incoherent  Space. 

For  this  case  the  momentum  equations  (143),  (145)  and  (149)  become 


p  U  =  -  ifi  sec  6„„  cos  6  3U/3a  exp(j9  ) 

*^a  UU  c  Tt  wa 


(178) 


where  0^^^  is  given  by  equation  (149).  For  this  case  the  cartesian  form  of 
Schrbdinger 's  equation  (167)  combined  with  equations  (51)  through  (54)  gives 

-  y  sec  3  cos  6  9/9a(sec  cos  3  9U/9o)exp(j6  )  =  k^U  (179) 

^  unwn  nn  '  ITTT  nr>  nm 


where  is  given  by  equation  (172).  Equation  (179)  can  be  written  approxi¬ 
mately  by  assuming  that  the  internal  phase  angles  of  the  component  terms  are 
equal,  which  gives  Schrbdinger 's  equation  as 


-  y  sec  3  cos  3  9 /9a  (sec  B„,,  cos  3  9U/9a)  =  k  U 

4-  nn  ill  I  nn 


(180) 


0  =  20,  +  0,, 

no  k  U 


(181) 


where  equation  (181)  is  valid  for  a  =  x  ,  y  and  z  .  Combining  equations  (172) 
and  (181)  gives 
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+  B  -  2(8  +  B  )  -  20, 

UU  wkwa  o  oa  k 


(182) 


In  a  further  approximation  Schrttdlnger ' s  equation  (179)  can  be  written  analo¬ 
gously  to  equations  (180)  and  (181)  using  equations  (165)  and  (166)  as 


■  I  ®wawa  ^UU  ^ao 


e 


no 


20,  +  e„ 

ks  U 


(183) 

(184) 


Equation  (18A)  Is  equivalent  to 

B,„,  +  B  -  2(6  +  B  ) 
UU  wawa  a  aa 


ks 


(185) 


VThen  all  Internal  phase  angles  are  set  equal  to  zero  equations  (178)  and  (179) 
become  the  following  standard  equations  of  quantum  mechanics  for  slow  processes 
In  Incoherent  space 


P®^u  =  -  iRau/aa 


-  I  a'^u/aa" 

a 


k^U 

s 


(186) 

(187) 


where  kg  Is  given  by  equation  (165). 

Case  b.  Slow  Quantum  Processes  In  Coherent  Space. 

In  this  case  the  momentum  equations  (143),  (147)  and  (149)  become 


p  U  =  -  iR  sec  6,„,  sin  B  a 
*^a  UU  aa 


-1 


au/ae  exp(je  ) 

a  wa 


(188) 


which  can  be  used  to  obtain  the  limiting  case  of  a  slow  quantum  process  In  co¬ 
herent  space.  The  general  form  of  SchrUdlnger's  equation  (168)  combined  with 
equations  (84)  and  (85)  gives  for  this  case 


-  y  sec  B  sin  B 
^  wawa 


aa 


a"^  a/ae^(sec  B^j^  sin  B^J^^  9U/9ejj)exp(je^^)  (189) 


=  k^U 


where  Is  given  by  equation  (172).  An  approximate  representation  of  equation 
(189)  is  obtained  by  assuming  that  the  phase  angles  of  the  component  terms  of 
this  equation  are  all  equal  and  using  the  representation  In  equations  (87)  and 
(88)  with  the  result  that  SchrOdinger's  equation  is  written  as 


y  sec  B  sin  B  a  ^  9/36  (sec  B 
^  wawa  aa  a' 


UU  »■' 


6'  =  20, 
na  k 


0 


(190) 

(191) 
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Equation  (191)  can  be  rewritten  with  the  help  of  equations  (85)  and  (88)  as 


6„„  +  8  -  2(6  +  6  )  +  ir  -  20. 

UU  wawa  a  aa  k 


(192) 


vdtere  and  k  are  given  by  equations  (163)  and  (164).  A  further  approximate 
representation  of  Schrbdlnger's  equation  (189)  is  obtained  by  using  equations 
(159)  and  (160)  as 


y  sec  B  sin  B  3/30  (sec  B„„  sin  B  a"^  3U/30  )  «  k^U  (193) 


a  S 


B„„  +  B  -  2(6  +6  )  +  TT  =  26.  =  e„  =  6,, 

UU  wcwa  a  aa  ks  E  V 


(194) 


Note  the  plus  sign  on  the  left  hand  side  of  equations  (190)  and  (193)  which 
indicates  that  the  kinetic  energy  is  positive.  For  this  case  Buu  '''  0 
Baa  TT  /  2  . 

For  a  slow  quantum  process  in  coherent  spacetime  equation  (82)  is  valid 
and  the  momentum  equation  (188)  becomes 


p  =  ijfi/a  3U/3e 
a  a 


(195) 


which  agrees  with  equations  (34)  and  (143).  In  the  case  of  a  slow  quantum 
process  in  coherent  spacetime  equation  (82)  is  valid  and  Schrbdinger's  equation 
(189)  becomes 


-  y  sec  B®^  a"^  3^U/30^  expCjS®*^)  =  ic^U 

5  wawa  a  qa 


(196) 


where  from  equation  (90)  with  9tt  =  0 


0®^  =  6®^^  -20-7: 

qa  wawa  a 


(197) 


The  small  positive  angle  is  given  by 

.  „sc  sc  jqSC,,  sc  „sc /„SC 

tan  B  =  w  d6  /dw  =  E„  /F,, 
wawa  a  wa  a  Ua  Ua 

sec  B®""  =  (F®^)"^(E®^)^  + 

wawa  Ua  Ua  Ua 


(198) 


(199) 


where 


E®*^  =  -  3U/36 
Ua  a 

F®/^  =  3^U/30^ 
Ua  a 


E®'"  >  0 
Ua 


Ua 


(200) 


(201) 


Using  equations  (198)  through  (201)  allows  Schrbdinger's  equation  (196)  to  be 
written  as 
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(202) 


8C^  _  ,-2. 


Using  equations  (98)  and  (168)  shows  that  equation  (202)  is  equivalent  to  the 
following  complex  number  form  of  Schrddinger ' s  equation 


I  *  <>  ■ 


(203) 


where  in  an  analogous  manner  to  equation  (98)  and  (169) 
-sc  ,  ,_2,-sc  j-sc. 


(204) 


and  equation  (203)  therefore  follows  from  equations  (168)  and  (204). 


The  alternative  description  of  the  second  spatial  derivative  given  by 
equations  (87),  (88),  (96)  and  (97)  allows  SchrSdinger's  equation  (202)  to  be 
written  as 


r  “2,, -SC.  2  .  ,_sc.2,l/2  /-qSc  .. 

I  “  t(E,.  )  +  (Fn„)  ]  exp(3e^^  )  =  k  U 


scV  ,-2, 


(205) 


where 


+  IT 

rja  no 


(206) 


6®*^  -  20 
wuwa  a 


By  assuming  that  the  Internal  phase  angles  of  all  of  the  component  terms  of 
equation  (203)  are  equal,  this  equation  can  be  written  as  two  approximate  scalar 
forms  of  Schrbdinger’s  equation 


6  -  26  =  20, 
wawa  a  k 


(207) 


(208) 


Equation  (208)  is  the  appropriate  limiting  form  of  equation  (192).  A  further 
approximate  representation  of  Schrodinger's  equation  (205)  is  obtained  from 
equations  (165)  and  (166)  as 


(209) 


0  =  26.  =  6_  =  6„ 
T\a  ks  E  V 


(210) 


for  a  =  X  ,  y  and  z  .  Combining  equations  (206)  and  (210)  gives 


3'*'-  -  20  =  26,  =  0-  = 
wawa  a  ks  E  V 


(211) 


which  agrees  with  equation  (194)  f  r  the  case  of  a  slow  quantum  process  in  co¬ 
herent  space.  Equation  (209)  can  e  obtained  directly  from  equation  (193)  for 
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this  special  case.  Equations  (207)  through  (211)  give  approximate  scalar  forms 
of  Schrodlnter's  equation  for  a  slow  process  In  coherent  space. 


For  the  one  dimensional  case  SchrOdlnger's  equations  (203),  (205)  and  (206) 
can  be  written  for  a  slow  process  In  coherent  space  as 


(212) 

1/x^KE^)^  +  -  k^U 

(213) 

eSc'  ^  SC  _  29 

nx  wxwx  X 

(214) 

where 


tan  3 


sc 

wxwx 


E®^/F 


sc 

Ux 


(215) 


SchrOdlnger's  equation  (213)  can  be  written  exactly  as 


(216) 


3®*^  -  26  =  20, 

wxwx  X  k 


(217) 


where  6j^  and  k  are  given  by  equations  (163)  and  (164).  Approximate  forms  of 
SchrOdinger ' s  equations  (216)  and  (217)  are  written  as 


Ux  ^  s 


(218) 


3  -  29  =  29, 

wxwx  X  ks 


(219) 


where  kg  and  0j5^g  are  given  by  equations  (165)  and  (166)  and  where  x  =  constant. 
Equations  (216)  through  (219)  are  the  scalar  forms  of  SchrOdinger ' s  equation  in 
one  dimension  for  the  case  of  a  slow  quantum  process  in  coherent  space. 


Case  c.  Ultrafast  Quantum  Processes  in  Incoherent  Space. 


The  momentum  equation  for  this  case  is  obtained  from  equations  (143), 
(146)  and  (149)  to  be 

p  U  =  -  ift  CSC  6,„,  cos  3  U  90,,/9a  exp(j9  )  (220) 

UU  aa  U  wa 

or 

Pa  =  -  i^  CSC  3yy  cos  3^^  exp(j0;^)  (221) 


where 


0'  =0 
wa  wa 


(222) 


*  6uo 


e  -  6 

a  aa 


5<8 


%ihere  U  has  been  divided  out  of  equation  (221).  The  proper  f orm  of  SchrOdlnger '  s 
equation  (167)  that  is  suitable  for  passing  to  the  limiting  case  of  an  ultra¬ 
fast  quantum  process  in  incoherent  spacetime  can  be  obtained  from  equations 
(A9),  (59)  and  (62)  as 

■  I  ®UU  «P(J  ^^23) 

where  the  complex  number  wave  function  U  has  been  divided  out  of  equation  (223) 
and  where  equation  (60)  gives 


30  /3a  »  3/3o(e„  +  -  0  -  6  ) 

wa  '  U  UU  a  ao 


and  \diere  Ajja  is  obtained  from  equation  (62)  as 

A„  -  0  -  0„ 

Ua  not  U 


+  6  -  2(0  +  B  ) 

UU  wawa  a  aa 


(224) 


(225) 

/ 

I 


SchrOdlnger ' s  equation  (223)  can  be  represented  by  two  approximate  scalar  equa¬ 
tions  by  using  the  representation  in  equations  (64)  and  (65)  and  assuming  the 
equality  of  the  internal  phase  angles  of  each  term  in  equation  (223)  with  the 
result  that  SchrOdlnger's  equation  becomes 


2 

y  CSC  0  __  cos  B  CSC  B,ti,  30„/3a  30  /3a 

5  wawa  aa  UU  U  wa 


B.„,  +  e  -  2(0  +  B  )  -  TT 

UU  wawa  a  aa 


20, 


(226) 

(227) 


where  0|^  and  k  are  given  by  equations  (163)  and  (164).  A  further  approximation 
gives  SchrUdinger ' s  equation  as 


y  CSC  B  cos  B  CSC  B.nt  30„/3a  30  /3a 

5  wawa  aa  UU  U  wa 

6 


+  3  -  2(0  +  0  )  -  TT 

UU  wawa  a  aa 


28,  =  0„ 

ks  V 


(228) 

(229) 


where  kg  and  ©ks  are  given  by  equations  (165)  and  (166). 

For  an  ultrafast  quantum  process  in  incoherent  space  equation  (57)  is 
valid  and  the  momentum  equation  (221)  becomes 


p^^  =  -  ijft3ey/3a 


(230) 


where  for  this  case  0uu  =  Tr/2  ,  0a  =  0  »  Bran  =  0  and  0j 


,  Baa  “  0  Bwa  “  '^1'^  *  where  U 
has  been  divided  out  of  equation  (220),  and  which  agrees  with  equations  (31) 
and  (143).  In  the  case  of  an  ultrafast  process  in  incoherent  spacetime  the 
combination  of  equations  (57),  (223)  and  (225)  give  SchrUdinger ’ s  equation  as 


-  y  CSC  b“^  (30  /3a)^  exp(jA^^)  =  k^ 

5  wawa  U  Ua 


(231) 
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-  9., 

Ua  na  U 


(232) 


■  if/2  +  6  “ 

wawa  Ua 


(233) 


vhere  Bwocwo  obtained  from  equation  (71)  as 


(234) 


(235) 


>  0 
Uo 

<  0 
Ua 


(236) 


(237) 


Then  equation  (233)  gives 

K  =  tan  ^(|fJ!^|/eJ!^) 
Ua  Ua'  Ua 


=  -  Tr/2 

wawa 


and  from  equation  (78)  It  follows  that 


(238) 


6  =  e  +  K  +  IT 

na  U  Ua 


(239) 


while  equation  (81)  gives 


n  =  -  U(E„  -  jF„  ) 
a  Ua  Ua' 


(240) 


which  is  a  special  case  of  equation  (169). 

Combining  equations  (231),  (235)  and  (236)  gives  the  SchrUdinger  equation 


r  ,,_ui.2  .  ,„ui.2,l/2  /.-.ui.  ,-2 

-  I  [(E„„)  +  (Fj,„)  J  e*p(3X„^)  -  k 


(241) 


or  equivalently  using  equations  (232)  and  (233) 


(242) 


Using  equations  (168)  and  (240)  gives  SchrUdlnger ' s  equation  for  this  case 
also  as 


I  <  - 


(243) 
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SchrOdlnger's  equation  (242)  can  be  %nrltten  in  an  approxlntate  form  by  assuming 
that  each  of  the  phase  angles  in  the  sum  on  the  left  hand  side  of  equation  (242) 
are  equal,  with  the  result  that 


,ui,2  ^  ,„ui,2,l/2  _  .2 


(244) 


•'Ua  -  201^ 


(245) 


where  is  given  by  equation  (238) .  A  further  approximate  form  of  SchrOdin- 
ger's  equation  for  this  case  is  obtained  from  equations  (165)  and  (166)  as 


(246) 


<11  ■  26,  -  0„  -  0„ 
Ua  ks  V  E 


(247) 


where  Is  a  small  angle.  Equations  (244)  and  (245)  agree  with  equations 
(226)  and  (227),  and  equations  (246)  and  (247)  agree  with  equations  (228)  and 
(229)  in  the  limiting  case  of  an  ultrafast  process  in  coherent  space. 


For  the  one  dimensional  case  SchrOdinger ' s  equations  (242)  and  (243)  for 
an  ultrafast  process  in  Incoherent  space  are  written  as 


f,_ui.2  .  ._uiv2.1/2  ..  V  r2 


(248) 


^Ix 


(249) 


Schrbdlnger's  equations  (248)  or  (249)  can  also  be  written  as 
[(eJ^)2  +  (fJ^)2]1/2  .  1,2 


(250) 


<11..  ■  20, 

Ux  k 


(251) 


where 


'ux  -  l<l< 


(252) 


and  6y,  and  k  are  given  by  equations  (163)  and  (164).  The  approximate  forms  of 
Schrbdinger 's  equations  (250)  and  (251)  are  written  as 


[(e"^)^  +  (fJ{^)^]^^^  =  k^ 

Ux  Ux''  ^  s 


(253) 


''ux  “  ^®ks 


(254) 


where  kg  and  0^8  are  given  by  equations  (165)  and  (166) 


Case  d.  Ultrafast  Quantum  Processes  in  Coherent  Spacetime. 
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The  quantum  mechanical  expression  for  the  momentum  of  a  particle  that  is 
suitable  for  this  case  is  obtained  fr<xn  equations  (1A3),  (148)  and  (149)  as 

p^U  -  -ih  CSC  sin  6^^  U/a  36^/38^ 

or  equivalently 

-  -  Ifi  CSC  6^  sin  36„/3e^  (256) 

where 


8'  =0  -  e„  -  6,„,  -  6  -  e 

wa  wa  U  UU  aa  a 


(257) 


and  where  0  has  been  divided  out  of  equation  (255).  The  form  of  SchrOdlnger ' s 
equation  (167)  that  is  required  for  the  passage  to  the  limit  of  an  ultrafast 
quantum  process  in  coherent  spacetime  can  be  obtained  from  equations  (101), 

(102)  and  (225)  as 

-  y  CSC  e  sln^B  CSC  B,„,  a"^  3e„/3e  39  /3e  exp(jX„  )  =  (258) 

5  wawo  aa  UU  U  a  wa  a  Ua 

where  the  complex  number  wave  function  U  has  been  divided  out  of  equation  (258), 
^Ua  given  by  equation  (225),  and  where  equation  (149)  gives 


36  /36  =  3e„/39  -  1  +  3/36  (6,„,  -  6  ) 

wa  a  U  a  a'  UU  aa' 


(259) 


Schrbdlnger's  equation  (258)  can  be  written  as  two  approximate  scalar  equations 
by  using  equations  (115)  and  (116)  and  by  assuming  that  the  phase  angles  of  each 
term  in  the  sum  of  equation  (258)  are  equal  with  the  result  that  SchrOdinger 's 
equation  becomes 


y  CSC  B  sin^B  esc  B„,,  a  ^  30„/36  30  /36  = 

5  wawa  aa  UU  U  a  wa  a 


(260) 


B,„,  +  B  -  2(0  +’B  )  +  TT  =  28, 

UU  wawa  a  aa'  k 


(261) 


A  further  approximation  for  Schr8dinger*s  equation  follows  by  using  equations 
(165)  and  (166)  which  gives 


2  —2  2 
y  CSC  B  sin  B  esc  B,n,  a  30„/30  36  /36  =  k 

5  wawa  aa  UU  U  a  wa  a  s 


(262) 


B,n,  +  B  -  2(0  +  B  )  +  TT  =  20, 

UU  wawa  a  aa  ks 


(263) 


The  kinetic  energy  term  of  SchrOdinger 's  equation  in  (260)  and  (262)  appears 
with  a  positive  sign  as  should  be  the  case. 


For  the  case  of  an  ultrafast  process  in  coherent  space  equation  (99)  is 
valid  and  the  momentum  equation  (256)  becomes 
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(264) 


30^/30^  exp(-j0^) 

-  -  iR/o  30„/30 
U  01 


Equation  (264)  also  follows  directly  from  equation  (152)  for  the  special  case 
of  an  ultrafast  process.  For  the  case  of  an  ultrafast  quantum  process  in  co¬ 
herent  spacetime,  equations  (99),  (258)  and  (259)  gives  SchrSdlnger's  equation  as 


-  I  '‘"=  3e„/3e^(36„/se^  -  i)«pOx“)  -  £ 


,-2 


(265) 


where  equations  (99),  (128),  (130),  (133)  and  (225)  give 
,uc 


Uo 


0“*^  -  e 

no 


u 


0“*^  -  0  -  IT 

no  u 

_  20  -  ii/2 

wawa  a 


6„  -  20  -  ir 

Ua  a 


(266) 

(267) 

(268) 
(269) 


where  0j|^  and  0JJ^  are  given  by  equations  (130)  through  (133).  Equations  (125) 
through  (129)  give 

,uc  „uc  ,„uc 


tan  6. 

CSC  8 


wawa 

uc 

wawa 


Ua 


(270) 

(271) 


6 

wawa 
tan  6 


-  +  «Ua 

Oa  ■ 


(272) 

(273) 


where  from  equations  (121)  and  (122) 
^uc 


Ua 


36^/36^(36^36^  -  1) 


-  3^6/36^ 
Ua  U  a 


EtT  ^  0 

Ua 


Ua 


(274) 

(275) 


where  is  a  small  number.  It  is  also  useful  to  define  the  following  quantity 
,uc' 


Ua 


=  0 
=  6 


uc  . 

Tt  +  ^ 

Uo 

(276) 

1“"='  -  6 
na  u 

(277) 

i„  -  20 

Ua  a 

(278) 
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where  Is  a  small  angle. 

Combining  equations  (121),  (126)  and  (265)  gives  SchrOdinger's  equation  as 
-  I  a’^[(ES^)^  +  (279) 

Combining  equations  (276)  and  (279)  give  Schrbdinger's  equation  as 


r  -2,,„uCx2  .  ,_uc.2,l/2  /.,uc'v  .-2 

I  a  KEj,^)  +  (F„^)  1  «cp(jXo^  )  -  k 


(280) 


For  the  case  of  an  ultrafast  process  in  coherent  space  it  is  easy  to  show  that 
equations  (153)  through  (155)  give 


-2  rl.Jl.  pUC 

p  =  n  la  (-  E,,  +  jF,,  ) 

*^a  Ua  Ua 


so  that  Schrbdlnger ' s  equation  (280)  is  equivalent  to 


I  ^  .  JF-,  -  k^ 


(281) 


(282) 


which  is  a  result  that  can  also  be  obtained  from  equation  (168)  by  noting  that 
in  an  analogous  fashion  to  equation  (135) 


-uc  ri/-2/  pUC  .  .„uc. 
■'a  '  -  <-  ^ua 


Schradinger ' s  equation  (282)  can  be  rewritten  as 


I  a"^exp(-  2je^)[3ey/3e^(l  -  96^/36^)  +  j3^ey/3e^]  = 


(283) 


(284) 


An  approximate  set  of  scalar  SchrOdinger  equations  that  are  equivalent  to  equa¬ 
tions  (280)  and  (282)  are  written  as 


I  -  k^ 


=  20 

Ua  k 


Combining  equations  (278)  and  (286)  gives 


6„  -  20  =  20, 

Ua  a  k 


(285) 


(286) 


(287) 


Equations  (285)  and  (287)  agree  with  the  limiting  forms  of  Schrbdinger 's  equa¬ 
tions  (260)  and  (261)  for  the  case  of  an  ultrafast  process  in  coherent  space. 

A  further  approximate  form  of  Schrbdinger 's  equation  follows  from  equations 
(165),  (166),  (285)  and  (286)  as 


I  .  kf 


6„  -  20  *  20.  =  0,,  =  6_ 

Ua  a  ks  V  E 


(288) 


(289) 
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For  one  space  dimension  SchrOdinger's  equation  for  an  ultrafast  process 
in  coherent  space  is  given  by  any  of  the  following  three  forms 

i/x^[d0y/de^(i  -  dOy/de^)  +  jd^Oy/de^]  -  (290) 

x‘^exp(-  2j0^)tdey/d0^(l  -  d0y/d0^)  +  jd^0y/d0^]  -  (291) 

mh-  ^  +  JFj^)  -  k^  (292) 


where  x  *  constant.  Schrddinger 's  equation  (292)  can  also  be  written  as 


(293) 

-  20 

=  20, 

(294) 

Ux  X 

k 

where 

tan  6„  ■ 

Ux 

C295> 

The  approximate 

form  of  SchrOdlnger's  equations  (293)  and 

(294)  are  written  as 

[(eS^)2  + 

._uc,,2,l/2  ,2  2 

(296) 

-  20 

Ux  X 

(297) 

In  general  k  >  k(x>0^)  while  kg  *  kg(x)  «  constant. 

The  kinetic  energy  term  for  the  case  of  an  ultrafast  quantum  process  in 
coherent  space  can  be  obtained  as  a  special  case  of  the  more  general  situation 
of  an  arbitrary  qxiantum  process  in  coherent  space  which  is  represented  by  equa¬ 
tion  (155)  of  this  paper  or  equation  (250)  of  Reference  26,  so  that  Schfbdinger 's 
equation  for  an  arbitrary  process  in  coherent  space  is  given  by 

I 

I  l/a^O^U/30^  -  j3U/90  )  =  k^O  (298) 

a  a  a 

Schrddinger ' s  equation  (298)  can  be  specialized  to  the  case  of  an  ultrafast 
process  in  coherent  space  by  noting  that  for  this  case 

30/30^  =  jU3ey/3e^  (299) 

3^0/30^  =  U[-  (36^/36^)^  +  j3^ey/3e^]  (300) 

Combining  equations  (298)  through  (300)  immediately  gives  the  Schrbdinger 's 
equation  (282) . 

C.  Examples  of  Complex  Number  Potential  Functions. 

This  section  considers  several  complex  number  potential  functions  which 
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are  of  Interest  to  the  case  of  ultrafast  processes  In  coherent  space.  A  simple 
solution  to  Schrhdlnger's  equation  (290)  or  (291)  can  be  obtained  for  the  spe¬ 


cial  simplifying  condition 
2-2  22  2 

xk  “xk  «a  ■  real  constant  (301) 

vhich  may  be  approximately  true  over  a  limited  range  of  space.  Equation  (301) 
gives  the  constant  wave  number  as 

k  -  a/x  (302) 

where  x  >  constant  for  coherent  space.  Equation  (157)  shows  that  the  conditions 
in  equations  (301)  and  (302)  are  exactly  valid  for  a  choice  of  the  complex  num¬ 
ber  potential  as 

V  -  E  -  [a^fi^/(2u)]/x^  (303) 

Then  Schrbdlnger ' s  equation  (290)  is  written  as 

de^/ds^d  -  d0y/d6^)  +  jd^ejj/de^  =  a^  (304) 

which  gives 

de^/dS^d  -  d0y/d0^)  =  -  ^  (305) 

d^0y/d02  =  =  0  (306) 

Equation  (306)  suggests  a  solution  of  the  form 

0y  =  +  g  (307) 

where  f  and  g  are  constants.  Combining  equations  (305)  and  (307)  gives 

f(l  -  f)  =  (308) 

whose  solution  is 

f  =  1/2  ±  1/2(1  -  (309) 


=  1/2  ±  1/2(1  -  4k 
where  x  =  constant 

One  example  of  the  complex  number  potential  for  the  case  of  an  internal 
phase  harmonic  oscillator  in  one  space  dimension  is  given  by 

V  =  l/2£i^  K  =  K  exp(j0j^)  (310) 

»  l/2Kx^  exp[j(0j^  +  20^)] 
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tfhere  K  -  constant  and  x  -  constant  so  that  6*  is  the  dynaadcal  variable.  The 
scalar  equations  corresponding  to  equation  (310)  are 


V 


l/2Kx^ 


constant 


(311) 


26 


(312) 


The  complex  nuaiber  wave  nun^er  E  for  the  simple  harmonic  oscillator  is  obtained 
from  equations  (157)  and  (310)  to  be 

-  2u/h^(E  -  l/2Kc^)  (313) 


while  equations  (159)  through  (164)  determine  k(e^)  and  e]t(6x)  .  SchrOdinger's 
equation  (290)  becomes  for  this  case 

i/jc^tde^/de^d  -  dSy/de^)  +  jd^e^/de^l  -  Zu/h^d  -  i/2iS^)  (3i4) 


whose  solution  Is  not  simply  obtained. 

Another  possible  type  of  simple  harmonic  oscillator  potential  that  may 
describe  Internal  space  motions  Is  given  by 

Vi  -  l/2K^e^  =  exp(je^^)  (315) 

where 


K.  =  K.  exp(je„.  ) 
lx  lx  Klx-' 

so  that 


(316) 


V.  =  1/2K.  0 
1  lx  X 


6  =6 
Vi  Klx 


(317) 


Then  equation  (157)  gives  the  wave  number  as 
k^  -  2w/fl^(E^  -  l/2K^e^) 


(318) 


from  which  k(0x)  and  0k(®x)  calculated  by  equations  (159)  through  (164). 

The  Schradinger  equation  (290)  for  this  case  is  written  as 


i/x^td0y/de^(i  -  dSy/de^)  +  jd^e^j/de^]  =  Zu/h^d^  -  i/2K^e^) 


2,= 


_  2 
■\v»  r\ 


(319) 


Equation  (319)  is  equivalent  to  equation  (372)  of  Reference  27  for  the  special 
case  of  a  coherent  wave  function  that  is  associated  with  an  ultrafast  quantum 
process. 


3.  TIME  DEPENDENT  SCHRODINGER  EQUATION  IN  BROKEN  SYMMETRY  CARTESIAN 
SPACETIME.  This  section  considers  the  time  dependent  Schradinger  equation  for 
slow  and  ultrafast  quantum  processes  in  spacetime  with  broken  internal  symme¬ 
tries.  Only  cartesian  coordinates  are  considered.  The  complex  number  time 
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dependent  SchrOdlnser  equation  Is  written  as  the  following  generalization  of  the 
standard  equation^^'^^ 

-  +  Vf  -  i^3?/3t  (320) 

where  the  wave  function  and  potential  are  %n:itten  as 
?  -  V  exp(j0^) 

V  -  V  exp(j0^) 

For  cartesian  coordinates  equation  (320)  becomes 


-  3^V/3a^  +  V?  =  ife3?/3t  (323) 

a 

where  ?  ■  ?(5,t)  .  Combining  equations  (16),  (49)  and  (323)  gives 

-  h^/i2\i)l  I  +  VV  -  iJiu  (324) 

a  ® 

where  u  and  Cq  ere  given  by  equations  (16)  and  (49).  Equation  (324)  can  be 
rewritten  as 

-  h^/(2v)l  exp(j0^^)  +  Vy  exp(j(ey  +  0^)]  ihu  exp(j0^)  (325) 

where  and  u  are  evaluated  in  Section  1  for  various  spacetime  cases.  In  one 
dimension  equation  (325)  becomes 

-  R^/(2m)K^  exp(je^^)  +  vy  exp[j(ey  +  0^)]  =  ihu  exp(j0^)  (326) 


where  u  and  roust  be  evaluated  according  to  the  relevant  spacetime  conditions 
as  in  Section  1 . 

The  exact  solution  of  equation  (325)  requires  that  the  real  and  imaginary 
parts  of  each  component  term  be  taken,  but  this  leads  to  very  complicated  equa¬ 
tions.  A  simpler  but  approximate  procedure  of  solving  equation  (325)  assumes 
that  the  internal  phase  angles  of  each  term  in  equation  (325)  are  equal.  Then 
because  the  internal  phase  angles  of  the  second  derivative  can  have  a  (0,±ir) 

term  included  as  described  in  Section  1  in  equations  (54),  (65),  (88)  and  (116), 

and  the  internal  phase  angles  of  the  first  derivative  with  respect  to  time  can 

have  a  term  (0,±Tr/2)  added  as  described  by  equations  (39),  (41),  (44)  and  (47), 

the  solution  of  equation  (325)  requires  that  eight  possible  cases  be  considered 
individually:  slow  and  fast  processes,  coherent  and  incoherent  space,  and  co¬ 
herent  and  incoherent  time.^^  Therefore  equation  (325)  can  be  written  analo¬ 
gously  to  equations  (170)  and  (171)  as  the  following  approximations  that  are 
obtained  by  assuming  that  the  internal  phase  angles  of  each  component  term  are 
equal ^  ^ 


(321) 

(322) 
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(327) 


-  (±l)fi^/(2y)^  £  +  Vy  -  iftu 

a  ® 

®5a  “  ®V  ®y  "  ®u  (0.±ir/2)  (328) 

From  equations  (26)  and  (5A)  It  follows  that  equation  (328)  can  be  written  in 
analogy  to  equation  (174)  as  the  following  approximation 

Svava  -  +  <“•-’>  '  \  (329) 

A  comparison  of  equations  (174)  and  (329)  shows  that  the  following  approxima¬ 
tions  are  valid  for  a  stationary  state 

+  <“■-’'2)  -  2\s  ■  \  -  ®E  »30) 

and 

The  time  dependent  SchrSdinger  equation  given  by  equations  (327)  through  (331) 
will  now  be  delineated  for  the  eight  possible  states  that  are  associated  with  a 
slow  or  ultrafast  quantum  process,  coherent  or  incoherent  spatial  coordinate 
variation,  and  coherent  or  incoherent  time  variation.^’  Reference  27  on  wave 
propagation  establishes  the  basis  of  development  for  this  section. 

Case  a.  Slow  Quantum  Process,  Incoherent  Space  and  Incoherent  Time. 

This  section  develops  a  general  form  of  the  time  dependent  Schrbdinger 
equation  that  can  be  used  to  obtain  the  linnfttiiiig:  case  of  a  slow  quantum  process 
that  occurs  in  incobereat  space  and  incoherent  time  which  is  (tescrlbed!  by 

6,-0  6„  -  0  6^-0  -  0  6*  >  O'  -  0^  (3i2) 


For  the  case  when  all  Internal  phase  angles  are  small,  equations  (327)  and  (328) 
are  written  as 

-  k^l{2\y)l  K  +  Vy  =  ±hu  (333) 

a  “ 


~  0,1  +  0,,,  =  0 
V  y  u 


(334) 


For  this  case  equations  (22),  (52)  and  (333)  give 

-  R^/(2vi)y  sec  3  cos  3  9/9a(sec  3,,,,„  cos  3  9y/9a)  (335) 

5  vava  aa  yy  aa 

+  vy  =  ifi  sec  3^,j,  cos  3^^.  9y/9t 


where  is  given  by  equations  (17),  (21)  and  (55).  Combining  equations 

(329)  and  (334)  gives 
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+  6  -  2(0  +  e  ) 

W  vava  o  ao' 


y  y 

6yy  -  ”  ®tt 


(336) 


For  the  case  of  a  slow  quantum  process  in  incoherent  space  and  incoherent  time 
equation  (332)  is  valid  and  equation  (335)  becomes  the  standard  time  depend«it 
Schrddinger  equation 

-  3^y/3o^  +  vy  -  ift3y/3t  (337) 

a 

while  equation  (336)  gives  a  null  result.  A  stationary  state  follows  from  the 
choice 


y  =  U(a)  exp(-iEt/fi) 


(338) 


which  combined  with  equation  (337)  yields  the  stationary  state  Schrbdinger 
equation  (187). 

Case  b.  Slow  Quantum  Process,  Coherent  Space  and  Incoherent  Time. 

A  general  form  of  the  time  dependent  Schrbdinger  equation  is  developed 
that  can  be  used  to  deduce  the  limiting  form  of  this  equation  for  a  slow  quan¬ 
tum  process  in  coherent  space  and  incoherent  time  which  is  characterized  by 

S  =  0  ■  0  «c.c  -  -  0  ‘  ° 

For  the  general  case  equations  (22)  and  (84)  can  be  combined  with  equation  (325) 
to  give  the  following  time  dependent  Schrbdinger  equation 

-  n  /  {2m)^  sec  8  sin  8  a 
g  vava  aa 

+  vy  =  ifi  sec  Bvjivj,  cos  9y/9t 
An  approximate  representation  of 


9/9e^(sec  6^^  sin  6^^  a"^  9y/9e^)exp(je^^) 


exp(je  ) 


(340) 


equations  (323)  or  (340)  is  given  by 


^'^/(2u)[  5  +  vy  =  i^u 


0.  +  TT  =  0„  +  0^  =  0 

Ca  V  y  u 


or  equivalently 


S^/(2vj)y  sec  8  sin  8  a  ^  9/90  (sec  sin  8  ct  ^  9y/90  ) 
a  vava  aa  a  yy  aa  a 

+  V'i’  =  ih  sec  B(j,^  cos  8^^^  9y/9t 


(341) 

(342) 


(343) 


^yy  ®vava  "  ^^®a  ®aa^  ^  ®V  “  ^yy  ~  ®t  ^tt 


(344) 


which  allows  the  kinetic  energy  term  to  appear  as  a  positive  quantity. 


The  time  dependent  Schrodinger  equation  which  is  the  limiting  form  of 
equation  (340)  for  the  case  of  a  slow  quantum  process  in  coherent  space  and  in¬ 
coherent  time  follows  from  equation  (98),  (136)  and  (339)  as 

h^/(2v)^  +  JE®^)  +  Vf  -  ifiay/at  (345) 

For  this  special  case  of  a  slow  quantum  process  in  coherent  space  and  Incoherent 
time  equations  (339),  (343)  and  (344)  give  the  following  approximate  time  depen¬ 
dent  SchrOdlnger  equations 

ifiat/at  (346) 

(347) 

I 

by  equations  (91)  through  (94). 

=  (348) 

and  (347)  also  follow  directly  from 
wave  function  has  the  form 

4*  =  U  exp(-iEt/fi)  (349) 

and  when  placed  in  the  time  dependent  Schrbdinger  equation  (345)  yields  the 
stationary  state  equation  (203)  while  equation  (348)  becomes  equation  (209) . 

Case  c.  Slow  Quantum  Process,  Incoherent  Space  and  Coherent  Time. 

This  section  derives  the  proper  form  of  tdie  time-  dependent  Schrbdinger 
equation  for  passing  to  Che  limit  of  a  slow  quantum  process  occurring  in  inco- 
bDerent;  space  usd  cohecenC  time  whiedl  is  described  by 

6=0  8  =0  0=0  8  =  0'  8**^=  Tr/2  (350) 

4'  4'4'  o  aa  tt 

Equations  (24)  and  (52)  are  combined  with  equation  (325)  to  give  the  following 
Schrodinger  equation 

-  h^/(2u)l  sec  6^^^^  cos  8^^  3/aa(sec  8^^  cos  8^^  a4'/3a)exp(je^^)  (351) 

+  Vf  =  ifi  sec  8^^  sin  8^^.  t  ^  34'/ae^  exp(je^) 

An  approximate  representation  of  the  Schrddinger  equation  (351)  is  given  by 

-  h^/(,2u)l  5  +  V4'  =  i^iu  (352) 

a  “ 

0,  =  e„  +  0„  =  0  +  ti/2  (353) 

5a  V  4*  u 


fr/i2\i)l  sec  8®^^  a  3^y/30^  +  Vy  = 
^  vava 


a 


-  20  =  0, ,  =  0_  =  0 
va-/a  a  V  E 


sc 

where  is  a  small  positive  angle  given 

Equation  (346)  can  also  be  written  as 

h^/(2v)l 


The  approximate  Schrddinger  equations  (348) 
equation  (345)  .  For  a  stationary  state  the 
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or  equivalently  using  equations  (24),  (26),  (52)  and  (54) 

-  R^/(2u)J[  sec  cos  3/3o(sec  cos  dHl/ia) 

+  VH-  -  iR  sec  8^^  sin  8^^  t“^  av/ae^ 


■*’  ®vava  ~  ®oo^  “  ®V  “  ^*4'  "  ®t  "  ®tt  ■‘' 


(354) 


(355) 


where  the  right  hand  sides  of  equations  (352)  and  (354)  are  real  numbers,  and 
where  6y  has  been  subtracted  from  both  sides  of  equation  (355).  The  ir/2  term 
in  equations  (353)  and  (355)  is  required  to  pass  to  the  limit  of  a  slow  quantum 
process  in  incoherent  space  and  coherent  time. 


For  the  special  case  of  a  slow  quantum  process  in  incoherent  space  and 
coherent  time  equations  (350),  (354)  and  (355)  give  Schrbdlnger ' s  equation  as 


-  h^/(2M)l  a^y/ao^  +  vv  -  iRt“^a4'/ae'f 


(356) 


The  phase  angle  equation  (355)  becomes  in  the  limit 
0  "v-  0  '\-  6  =  -  0'^ 

V  I!i  t 


(357) 


Consider  now  the  wave  function  for  the  stationary  state  for  the  slow  process  in 
incoherent  space  and  coherent  time  which  is  taken  to  have  the  following  form 


4*  =  U  exp(-iEt0^/R) 


(358) 


where  t  =  constant.  Combining  equations  (356)  and  (358)  gives  the  Schrodinger 
equation  for  the  stationary  state  as 


-  9^U/9a^  +  VU  =  EU 

a 


(359) 


which  is  the  standard  time  independent  Schrbdinger  equation. 

Case  d.  Slow  Quantum  Process,  Coherent  Space  and  Coherent  Time. 

This  section  obtains  the  required  form  of  the  time  dependent  Schrbdinger 
equation  that  is  needed  to  obtain  the  limiting  condition  of  a  slow  quantum  pro- 
cess  in  coherent  space  and  coherent  time  which  is  specified  by 


64,  =  0 


=  0 


8  =  ir/2 

aa 


B  =  ir/2 
tt 


(360) 


Equations  (24),  (84)  and  (325)  gives  Schrbdinger's  equation  as 
-  R^/(2M)y  sec  8  sin  8  a  ^  9/90  (sec  sin  8  a  ^  94'/90  )exp(j0_  ) 

“  vava  nn  n'  W  nn  n  r  Fr, 


aa 


4'4' 


aa 


+  VV  «  iR  sec  8^y  sin  8^^  t"^  a4'/a0^  exp(j0^) 


(361) 
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An  approximate  representation  of  SchrOdinger's  equation  (361)  is  given  by 

fi^/(2w)|  +  Vf  »  iftu  (362) 


+  Tt  -  e„  +  -  6  +  Till 

V  y  u 

or  equivalently,  SchrOdinger 's  equation  can  be  written  approximately  as 

Til  t  ^  \  n  n  _ •  ^  _1 


(363) 


h}/il\i)l  sec  6  sin  6  a"^  3/36  (sec  sin  B  a"^  3'}'/3e  )  (364) 

g  vava  aa  a  aa  a  ' 


+  V'F  =  id  sec  6^^  sin  6^^  t"^  3y/3e^ 


*  Svcva  '  ®aa>  ’  S,  '  <  '  »65) 

where  641  has  been  subtracted  from  both  sides  of  equation  (365).  The  right  hand 
sides  of  equations  (362)  and  (364)  must  be  real  numbers.  The  ir  and  n/l  terms 
in  equation  (365)  are  necessary  to  pass  to  the  limiting  case  of  a  slow  quantum 
process  In  coherent  space  and  coherent  time. 


The  limiting  form  of  equation  (361)  for  the  case  of  a  slow  quantum  pro¬ 
cess  in  coherent  space  and  coherent  time  is  the  following  exact  comple*  number 
Schrddinger  equation 


R^/du)^  1/5^(F®^  +  jE®^)  +  Vy  «  ih/l  3¥/3ftJ 


(366) 


A  combination  of  equations  (360),  (364)  and  (365)  for  the  special  case  of  a  slow 
quantum  process  in  coherent  space  and  coherent  time  gives  the  approximate  SchrO- 
dlnger  equation  as 


fi^/(lv)l  sec  B®*^  a  ^  3^4'/3e^  +  VY  =  /del 

k  vava  a  t 


6®*^  -  26  =  e,  =  -  el”  =  e„ 

vava  a  V  t  E 


(367) 


(368) 


Combining  equations  (92)  through  (94)  with  equation  (367)  gives  the  following 
approximate  Schrbdinger  equation 


h‘^niM)l  a’2[(E®^)^  +  (F®^)^]^/2  +  74-  =  iRt'^34'/36^ 


-1  y 


(369) 


Equation  (369)  also  follows  directly  froi  equation  (366).  For  the  choice  of  the 
wave  function  for  coherent  time  of  the  form 


y  =  U  exp(-iEt8^/S) 


(370) 


=  U  exp(-lEt8^/^) 
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equation  (366)  becomes  the  stationary  state  time  independent  Schrddinger  equa¬ 
tion  given  in  equation  (203) ,  while  the  choice  of  the  wave  function  in  equation 
(370)  brings  equation  (369)  into  the  form  of  the  time  independent  Schrddinger 
equation  (209) . 

Case  e.  Ultrafast  Quantum  Process,  Incoherent  Space  and  Incoherent  Time. 

This  section  considers  the  general  form  of  the  time  dependent  Schrbdinger 
equation  that  is  suitable  to  pass  to  the  limiting  case  of  an  ultrafast  quantum 
process  in  incoherent  space  and  incoherent  time  which  is  described  by 

°  °  ®t  “  °  ®tt  “  ° 

Equations  (23) ,  (59)  and  (325)  give 

-  ft^/(2y)|  CSC  cos^e^^  esc  V  30^/3o  3e^^/3a  exp(j0^^)  (372) 

+  Vy  »  ifi  CSC  6^^  cos  6^^  y  3e^/3t  exp(j0^) 

Schrodinger ' s  equation  (372)  can  be  represented  approximately  by  the  following 
two  equations 

h^/i2\i)l  e  +  Vy  =  ±hu  (373) 

a  “ 

*  «V  S  ‘ 

where  the  term  exp(j04<)  has  been  factored  out  of  equation  (373).  Equations  (373) 
and  (374)  can  be  rewritten  as  the  following  approximate  equations 

R^/(2y)|  CSC  ^yy 

=  iR  CSC  B^^  cos  b'[^.  30^/3t 


Svava  -  *  "v  =  '  “I  '  ‘It  ' 


(376) 


The  -  TT  and  -  •it/2  terms  that  appear  in  equations  (374)  and  (376)  are  required 
to  pass  to  the  limiting  case  of  an  ultrafast  quantum  process  in  incoherent  space 
and  incoherent  time . 

In  the  special  case  of  an  ultrafast  process  in  incoherent  space  and  inco¬ 
herent  time  equations  (81)  and  (372)  give  the  following  exact  complex  number 
form  of  Schrodinger ’s  equation 


h^/i2v)l  (Ey^  -  jfIJ^)  +  V  =  ijfi30y/3t 


(377) 


where  y  has  been  divided  out  of  equation  (372)  to  obtain  equation  (377).  For 


this  special  case  equations  (371),  (375)  and  (376)  give  an  approxismte  form  of 
Schrddinger's  equation  as 


h^/(2u)l  CSC  (36^/30)^  +  V  -  iR38^/3t 

®vava  -  "  So  “  ®V  “  ®E  "  0 


(378) 


(379) 


Equations  (74)  and  (378)  give  the  following  approximate  Schrhdlnger  equation 


ft^/(2w)I  [(E^S^  +  +  V  -  ifi3e  /3t 


(380) 


Equation  (380)  also  foUows  directly  from  equation  (377).  Equations  (375), 
(377),  (378)  and  (380)  require  that  the  stationary  state  for  Incoherent  time 
is  given  by 


6^  -  00  +  6^^ 


-  -iEt/fi 


(381) 


which  makes  the  right  hand  sides  of  equations  (378)  and  (380)  real  numbers,  and 
is  the  stationary  state  solution  which  when  placed  in  equations  (377)  and  (380) 
gives  the  time  independent  equations  (243)  and  (246)  respectively. 

Case  f.  Ultrafast  Quant*»-i  Process,  Coherent  Space  and  Incoherent  Time. 


This  section  examines  the  form  of  the  time  dependent  SchrSdlnger  equa¬ 
tion  that  can  be  used  to  pass  to  the  limit  of  an  ultrafaat  quantum  process  in 
ccdieient  space  and  Incoherent  time  idiose  characteristics  are 


8™™  “  tr/2  a  =  ir/2  -  0  sL  -  0 

XT  aa  t  tt 


(382) 


Equations  (23),  (101)  and  (325)  give  the  relevant  time  dependent  SchrOdinger 
equation  as 


-  h  /(2v)l  CSC  sin  esc  6^^  y/a^  ^®va''^®a  exp(j0^„) 


+  vy  «  i^  CSC  8^^  cos  y  30^/31  exp(j0^) 


(383) 


Equation  (383)  can  be  separated  into  the  following  two  approximate  Schrbdinger 
equations  by  using  equations  (41),  (115)  and  (116) 


h^/(2M)l  5  +  vy  =  ifiu 


(384) 


0.  +  TT  =  0„  +  0^  =  0  -  11/2 

V  y  u 


(385) 


which  are  equivalent  to  the  following  approximate  SchrOdinger  equations 

h^/(,2u)l  CSC  B  sin^B  Cb.  a~^  36  /30  +  V 

k  vava  aa  yy  y  a  va  a 


(386) 


=  ih  CSC  B^^  COS  B^j.  36^j,/3t 
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+  6  -  2(6  +6  )  +  ^  «  e,,  -  6^^  -  e'f  -  -  n/2  (387) 

'('¥  vava  a  aa  V  tr  t  tt 

where  a  factor  ?  has  been  divided  out  of  equation  (386).  The  right  hand  sides 
of  equations  (384)  and  (386)  must  be  real  numbers  in  internal  space. 

An  exact  complex  number  Schrddinger  equation  that  is  deduced  from  equation 
(383)  in  the  limiting  case  of  an  ultrafast  quantum  process  in  coherent  space 
and  incoherent  time  is  written  as 

k^l{2v)l  eJq  +  jPja)  +  V  =  ij^9e^/3t  (388) 


where  a  factor  4*  has  been  factored  out  to  obtain  equation  (388).  For  this  lim¬ 
iting  case  equations  (382),  (386)  and  (387)  give  the  following  approximate  Sch- 
rBdinger  equations 

A^/(2ii)T  CSC  a~^  30u,/30  (30u,/30  -  D  +  V  =  iRse^/at  (389) 

“  vava  r  a  r  a  4* 

b'^^  -  20  +  Tr/2  =  0„  =  0^  =  0  (390) 

vava  a  V  E 


Equations  (389)  and  (390)  can  be  rewritten  using  equations  (126)  and  (128)  re¬ 
sulting  in  the  approximate  Schrbdinger  equations  given  by 


h^/(2u)l  +  V  =  ifi36^/3t 


(391) 


6 


4*0 


26  =  6. 


(392) 


where  Eij/^  and  are  given  by  equations  (121)  and  (122),  and  where  the  right 
hand  sides  of  equations  (389)  and  (391)  must  be  real  numbers  in  internal  space. 
The  approximate  Schrodinger  equations  (391)  and  (392)  can  also  be  obtained  di¬ 
rectly  from  equation  (388).  Schrodinger' s  equation  (391)  has  a  stationary 
state  solution  for  incoherent  time  which  is  of  the  form 

%  =  ®u  ®Yt 


and  equations  (388)  and  (391)  become  the  time  independent  Schrodinger  equations 
(282)  and  (288)  respectively. 

Case  g.  Ultrafast  Quantum  Process,  Incoherent  Space  and  Coherent  Time. 

This  section  considers  the  time  dependent  Schrodinger  equation  that  can 
be  used  to  obtain  the  limiting  form  of  the  equation  for  the  case  of  an  ultra¬ 
fast  quantum  process  in  incoherent  space  and  coherent  time  which  is  described  by 

6„  -  W2  9^-0  b’j  =  n/2  (394) 

Equations  (25),  (59)  and  (325)  give  SchrBdinger's  equation  as 
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-  CSC  exp(je^^)  (395) 

+  V?  -  ifi  CSC  B^^  sin  bJ^.  H'/t  36^/30^  exp(je^) 

Equation  (395)  is  represented  by  the  following  approximate  scal^  SchrSdinger 
equations  by  using  equations  (16),  (64)  and  (65) 


rr/i2u)l  £  +  vy  -  ±hu 


+  0U,  “  ® 

ia  V  y  u 


(396) 

(397) 


which  are  written  equivalently  as  the  following  approximate  Schrodinger  equa¬ 
tions  by  using  equations  (25),  (26),  (59)  and  (68) 


fi^/(2w)y  CSC  B  cos^B  esc  ^  30^/3o  30  /3a  +  V 

a  vava  aa  fV  y  va 

=  ih  CSC  B^^  sin  B^^.  t~^  30,y/3e|J 


(398) 


®yy  ^vava  ~  ®aa^  "  ~  ®V  "  ®yy  ~  ®t  "  ^tt 


(399) 


where  a  factor  y  has  been  divided  out  of  equation  (398)  and  the  angle  By  has 
been  substracted  from  equation  (397>  to  obtain  equation  (399). 

For  the  case  of  an  ultrafast  quantum  process  in  inedterent  space  and  co¬ 
herent  time  the  limiting  form  of  equation  (395)  can  be  obtained  from  eqxiations 
(48),  (81)  and  (324)  to  be  the  following  exact  SchrOdinger  equation 


h^K2M)l  (e"^  -  jF“^)  +  V  =  ifi/t  30^/30^ 


(400) 


where, a  factor  y  has  been  divided  out  of  equation  (400).  Combining  equations 
(394),  (398)  and  (399)  gives  the  following  approximate  Schrodinger  equations  for 
the  special  case  of  an  ultrafast  quantum  process  that  occurs  in  incoherent  space 
and  coherent  time 


(i^/(2p)I  CSC  (96^/30)^  +  V  .  ilit  '3e,,/3e’ 


6“‘ 

vava 


tt/2  = 


0 


y 

t 


(401) 

(402) 


Using  equations  (74)  and  (76)  allows  equations  (401)  and  (402)  to  be  written  as 
the  following  approximate  Schrodinger  equations 


S^/(2W)I  laff  +  +  V  .  liit-9e,/96; 

y 


,ui.2,l/2 


-1. 


“^ya  "  “  ®E  ®t 


(403) 

(404) 
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The  approxioate  SchrOdinger  equations  in  equations  (403)  and  (404)  can  be  ob¬ 
tained  directly  from  the  exact  equation  (400).  SchrOdinger 's  tiro  dependent 
equations  (400)*  (401)  and  (403)  for  coherent  time  have  a  stationary  state  so¬ 
lution  of  the  form 

"  ®u  (^05) 

-  -iEte^/fi  -  -iEteJ/fi  (406) 

where  the  following  reality  condition  has  been  used 

Et  -  Et  (407) 

which  is  valid  for  wave  propagation  with  constant  E  It  should  be  added  that 
for  wave  propagation  with  constant  wave  number  and  frequency  u  the  following 
conditions  are  also  valid^^ 

k^a  ■  k^a  5t  “  wt  (408) 

where  a  ®  x  ,  y  ,  z  .  Substituting  equations  (405)  and  (406)  into  (400)  and  (403) 
yields  the  time  independent  stationary  state  equations  (243)  and  (246)  respec¬ 
tively. 


Case  h.  Ultrafast  Quantum  Process,  Coherent  Space  and  Coherent  Time. 

Finally,  this  section  develops  the  form  of  the  time  dependent  Schrbdinger 
equation  that  can  be  used  to  pass  to  the  limit  of  an  ultrafast  quantum  process 
in  coherent  space  and  coherent  time  which  is  characterized  by 

=  Tr/2  6^^  =  Tr/2  6^^.  =  Tt/2  (409) 


Equations  (35),  (101)  and  (325)  give  the  proper  form  of  Schrbdinger ' s  equation 

2,.^.r  _  .2_  _  ...  I  2 


as 


-  fi^/(2y)y  CSC  S  sin^B  esc  H'/a^  86u,/96  96  /96  exp(j0  ) 

a  a  va  a  Ca 


+  Vy  =  iR  CSC  6  Sin  T/t  96  /9e]|  exp(j6  ) 


(410) 


As  before,  equation  (410)  can  be  separated  into  two  approximate  scalar  Schrbdin- 
ger  equations 


fi^/(2u)l  +  VH*  =  ifiu 

a  “ 


6  +  IT  =  6  +  6  *  6 

Co  V  u 


(411) 

(412) 


which  can  be  written  equivalently  as  the  following  approximate  SchrOdinger  equa¬ 
tions 
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(413) 


Jt^/(2w)y  CSC  e  CSC  B**  o"^  30„/3e  36  730  +  V 

g  vava  ao  tt  T  o  va  a 

-  ih  CSC  6^^  sin  0^^  t“^  30^/36^ 


®T»  *  *va*a  -  ^<®a  +  *<..>  +  *  *  %  *  8„  -  sj  -  (414) 

where  Oq  and  6^  are  variables. 

The  llmiclng  form  of  equation  (410)  for  Che  special  case  of  an  ulcrafast 
quantum  process  in  coherent  space  and  coherent  time  can  be  obtained  using  equa¬ 
tions  (48),  (135)  and  (324)  and  yields  an  exact  Schrbdlnger  equation  for  this 
case 

h^/i2u)l  1/5^ (-  +  jp"^)  +  V  -  ±h/l  3e^/3e|J  (415) 


where  '¥  has  been  divided  out  of  equation  (410)  to  obtain  equation  (415).  In  the 
special  case  of  an  ultrafast  quantum  process  In  coherent  space  and  coherent 
time,  equations  (409),  (413)  and  (414)  give  the  approximate  SchrBdlnger  equa¬ 
tions  as 


fi^/(2y)j;  CSC  30^/36^(36^/30^  -  1)  +  V  =  36^/30’^ 


e  -  26  +  ir/2  «  e„ 
vciva  a  V 


(416) 

(417) 


These  two  equations  can  be  rewritten  using  equations  (126)  and  (128)  as  the 
following  approximate  SchrOdlnger  equations 


fi^/(2v)l  +  V  -  ifit"^3e^/3e^ 


(418) 


(419) 


The  total  Internal  phase  angle  of  the  wave  function  Is  written  as  In  equations 
(405)  and  (406).  The  approximate  equations  (418)  and  (419)  can  also  be  obtained 
directly  from  the  exact  Schrbdinger  equation  (415).  Equations  (415)  and  (418) 
have  stationary  state  solutions  of  the  form  given  by  equations  (405)  and  (406) 
which  reduce  equations  (415)  and  (418)  to  their  time  independent  forms  given 
by  equations  (282)  and  (288)  respectively. 

4.  DIRAC  EQUATION  IN  BROKEN  SYMMETRY  SPACETIME.  This  section  investi¬ 
gates  slow  and  ultrafast  relativistic  quantum  processes  in  coherent  and  inco¬ 
herent  spacetime.  The  broken  symmetry  form  of  the  Dirac  equation  is  written  as 
the  following  complex  number  generalization  of  the  standard  Dirac  equation^® 

-  iRy  dV/dx  +  mf  =  0  (420) 

y  y 

where  =  Dirac  matrices,  ?  =  complex  number  four  component  Dirac  spinor  which 
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l8  now  a  set  of  four  complex  numbers  in  an  internal  space,  and  where  the  complex 
number  space  and  time  coordinates  are  designated  by 


X  -  X  exp(16  ) 
y  y  xy 


(421) 


where  y  «  0, 1,2,3  and  Xq  «  ct,  xj  ■  x  ,  X2  “  y  and  X3  ■  z  .  The  Dirac  equation 
(420)  can  be  rewritten  as  follows 


-  ifiy  +  m?  -  0 

y 


where 


exp(j0  )  -  3?/3x 
vy  y 


where 


-  sec  cos  B  3'i’/3x 

yy  xyxy  y 

-  CSC  Bu-,u  cos  B  y  30u,/3x 

yy  xyxy  y  y 


«  sec  B^u,  sin  B  x  3y/30 

yy  xyxy  y  xy 


CSC  Bu,u,  sin  6  y/x  30  /30 

yy  xyxy  y  y  xy 


(422) 

(423) 

(424) 

(425) 

(426) 

(427) 


where  Byy  is  given  by  equation  (14)  and  where 

0  =  0U,  +  3,„u/  ~  ®  “3 

vy  y  yy  xy  xyxy 


(428) 


tan  6  =  X  36  /3x 

xyxy  y  xy  y 


The  component  parts  of  the  Dirac  equation  (422)  can  be  written  as 

-  ifiy  v^  cos  9  +  my  cos  6,,,  =  0 

y  vy  y 

-  i^Y  v^  sin  9  +  my  sin  9,,,  =  0 

y  vy  y 


(430) 

(431) 


where  the  mass  m  is  taken  to  be  a  scalar.  An  approximate  representation  of 
equation  (422)  follows  from  the  assumption  that  the  phase  angles  of  each  term 
in  equation  (422)  are  equal,  with  the  result  that  the  Dirac  equation  is  written 
as 


-  i^Y  v^  +  my  =  0 
y 

6  =  Qu, 

vy  y 


(432) 

(433) 


Equations  (428)  and  (433)  give  for  this  approximation 
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0 


(434) 


g  -  e  -  B  - 

ft  xw  '^XMxy 

Equation  (434)  is  a  differential  equation  relating  e«p  and  as  can  be  seen 
from  equations  (14)  and  (429).  For  the  case  of  a  slow  quantum  process  (inco¬ 
herent  wave  function)  in  incoherent  spacetime,  all  internal  phase  angles  are 
set  to  zero  and  equations  (420),  (424)  and  (432)  reduce  to  the  standard  Dirac 
equation 


-  ifiy  3y/3x  +  ml'  -  0 

U  M 


(435) 


where  4*  is  the  standard  Dirac  four  component  spinor. 

For  coherent  spacetime  coordinate  variation  and  an  arbitrary  variation  of 
the  wave  function,  equation  (420)  becomes  the  Dirac  equation 


ij/i/x  Y  3?/36  +  m?  -  0 

U  P  xp 


(436) 


For  the  special  case  of  an  ultrafast  quantum  process  in  coherent  spacetime  equa¬ 
tions  (420)  or  (436)  give  the  following  Dirac  equation 


(-  ift/x  Y  36u</30  +  m)*!*  “  0 

p  P  S'  xp 


(437) 


Equation  (437)  is  a  matrix  equation  where 


and  where 


0  6. 


(438) 


(439) 


36^/36 
4*  xp 


4*1  xp 


4*2  xp 


3e^-/30 
4*3  xp 


30^,/3e 
4'4  xp 


(440) 


571 


wHere  w  *  0,1, 2, 3  .  The  Dirac  equation  (437)  for  an  ultrafast  quantum  process 
In  coherent  spacetime  can  also  be  deduced  from  equations  (422),  (427)  and  (428) 
by  setting  «  it/2  and  Bxpxy  ■  %#hlch  gives 


-  y/x  90  /9e 

-  y/x  90^/90 

u  y  XU 


0  -  Ou/  -  6 

vu  y  XU 


(441) 

(442) 

(443) 


The  real  and  Imaginary  parts  of  equation  (437)  can  be  written  as  the  following 
matrix  equations 

[-  ift/x  Y  00ui/30  cos(Ou,  -  0  )  +  m  cos  0iu]y  *  0  (444) 

u  u  y  XU  y  XU  y 

[-  ifi/x  Y  36u,/3e  Sln(0^  -  0  )  +  m  sin  0^]y  -  0  (445) 

'•  u  u  y  XU  y  XU  y 

where  cos(04«  -  Qxy)  »  sln(0Y  -  0xy)  »  cos  0\j(  and  sin  Oy  are  4x4  diagonal 
matrices. 

Approximate  matrix  equations  that  result  from  the  assumption  that  0 

in  equations  (444)  and  (445)  are  the  following 


(-  iR/x^  Y^^  ^®y/^®xu  ° 


(446) 

(447) 


(-  i^^/x  Y  +  m)y  0  (447) 

u  u  y  XU  I 

where  the  column  matrices  y^  and  yj  are  given  by  the  following  matrix  equations 


y^^  =  cos  y  = 


y^  =  sin  6^  y  = 


cos 

ft 

yi 

"2 

cos 

®y2 

^3 

cos 

®y3 

cos 

®4-4 

sin 

Si 

^2 

sin 

®y2 

"3 

sin 

®y3 

sin 

®y4 

(448) 


(449) 


Choose  the  following  linear  solution  for  the  diagonal  matrix  6^ 

0  =  y  b’*'© 

y  u  XU 


(450) 
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where  are  four  constant  diagonal  matrices  given  by 


b'*'^  0  0  0 

0  b***^  0  0 

y 

0  0  b'^^  0 

y 

y/ 

0  0  0  b^ 


(451) 


where  y  «  0,1, 2, 3  ;  so  that  b^  represents  sixteen  quantities.  The  component 
representation  of  the  matrix  equation  (450)  is  written  as 


=  I  b^'^e 

Vv  “  y  xy 


where 


b  =  30^  /ae 

y  "Vv  xy 


(452) 


(453) 


for  y  ^  0,1, 2, 3  and  v  0,1, 2, 3  .  Then  Dirac’s  equations  (437),  (446)  and  (447) 

can  be  written  as 


(-  ih/x  y  b***  +  m)?  =  0 

y  y  y 


(454) 


(-  iR/x  Y  b  +  m)4'_  'v^  0 
y  y  y  R 

(-  ih/x  y  b'*'  +  m)4'  0 

y  y  y  I 


(455) 


(456) 


which  are  valid  for  ultrafast  relativistic  quantum  processes  in  coherent  space- 
time. 


Finally  it  should  be  pointed  out  that  from  equations  (422)  and  (423)  it 
follows  that  the  Dirac  equation  for  an  ultrafast  quantum  process  in  Incoherent 
spacetime  is  given  by 


(-  ij^Y^  9®y/9*y  +  in)t  =  0 


(457) 


where  aSy/ax^  are  four  diagonal  matrices. 


aeyi/3x^ 


90y/3x^  = 


ae,3/3Xy 


(458) 


9e^^/ax^ 
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for  ii  *  Of  1,2 ,3  .  The  real  and  imaginary  parts  of  equation  (457)  are  given  by 
(ifiy^  3e^/3x^  sin  6^  +  m  cos  -  0  (459) 

(-  iRy^  cos  +  m  sin  e^)f  -  0  (460) 

where  cos  6)^  and  sin  6y  are  diagonal  matrices  in  analogy  to  equation  (439) . 

5.  CONCLUSION.  Quantum  mechanics  can  be  formulated  for  the  case  where 
the  wave  function  and  the  space  and  time  coordinates  have  broken  Internal  sym¬ 
metries  which  require  that  these  quantities  be  written  as  complex  numbers  In  an 
internal  space.  This  allows  the  possibility  of  slow  and  ultrafast  quantum  pro¬ 
cesses  occurring  In  spacetime  with  coherent  and  Incoherent  variation  of  coor¬ 
dinates.  For  an  ultrafast  quantum  process  Che  complex  number  wave  function  ro¬ 
tates  in  an  internal  space,  while  for  a  slow  quantum  process  the  wave  function 
changes  in  magnitude.  Spacetime  coordinates  can  likewise  change  coherently  by 
a  rotation  in  an  internal  space  or  incoherently  by  a  change  in  Che  magnitudes 
of  the  space  and  time  coordinates.  The  Dirac  equation  and  the  Schrbdinger  time 
dependent  and  time  Independent  equations  are  formulated  for  slow  and  ultrafast 
quantum  processes  that  can  occur  in  coherent  or  incoherent  space  and  time. 
Ultrafast  processes  are  Important  to  science  and  engineering  and  may  possibly 
be  the  basis  of  developing  energy  sources  such  as  the  nuclear  rocket  engine. 
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QUATERNARY  FISSION  OF  y  RAY  COOLED  ACTINIDES 


Richard  A.  Weiss 

U.  S.  Amy  Engineer  Uetenreys  Experinenc  Station 
Vicksburg,  Mississippi  39180 


ABSTRACT.  The  basic  concepts  needed  for  the  design  of  clean  fission 
nuclear  reactors  that  use  actinide  eleaents  such  as  and  ^*Pu  as  fuels  are 
developed  in  this  paper.  In  the  presence  of  an  electromagnetic  field  all  nuclei 
exhibit  a  broken  internal  symmetry  that  can  be  described  by  requiring  the  atomic 
number  and  atomic  mass  number  to  be  complex  niaibers  %rhose  magnit\ides  are 
functions  of  the  internal  phase  angles  of  the  complex  numbers.  The  fom  of  these 
functions  depends  on  the  value  of  the  fissility  parameter  of  a  nucleus.  For 
sxibactinide  nuclei  with  fissility  parameters  less  than  unity  the  measured  atomic 
number  and  measured  atomic  mass  ntudser  are  not  exactly  integers,  tdiile  for 
actinide  nuclei  whose  fissility  parameters  are  greater  than  unity  the  measured 
atomic  number  and  atomic  mass  nusber  must  be  integers.  This  basic  difference  in 
the  countability  of  nucleons  in  stibactinide  nuclei  and  actinide  nuclei  yields  two 
different  representations  of  the  complex  n\Jid>er  form  of  the  liquid  drop  type  of 
nuclear  mass  formula.  For  actinide  nuclei  in  an  external  electromagnetic  field 
the  complex  number  Bohr -Wheeler  fission  condition  predicts  that  above  a  critical 
7  ray  Intensity,  that  is  determined  by  the  value  of  the  fissility  parameter,  the 
thermal  neutron  induced  binary  fission  node  is  suppressed.  The  alternate  node 
of  quateniary  fission  due  to  the  thermal  neutron  induced  binary  fission  of  the 
two  copqionent  subactinide  lobes  of  an  actinide  nucleus  into  four  fission  product 
nuclei  is  catalyzed  by  the  presence  of  a  7  ray  field.  A  mathematical  expression 
is  derived  in  terms  of  the  fissility  parameter  that  gives  the  value  of  the 
critical  electromagnetic  field  strength  of  the  external  7  ray  field  required  to 
cool  and  suppress  thermal  neutron  induced  binary  fission  of  the  actinias.  The 
nuclear  fission  waste  products  of  the  7  ray  catalyzed  thermal  neutron  induced 
quaternary  fission  of  the  actinide  elements  will  be  low  level  radionuclides,  and 
represents  a  way  of  generating  clean  fission  nuclear  power.  Clean  fission 
nuclear  reactor  designs  are  considered  that  may  be  used  for  submarines  and 
nuclear  rocket  engines. 

1 .  INTRODUCTION .  Nuclear  fission  reactors  may  yet  be  the  power  source 
that  will  drive  the  industrial  nations  in  the  future.  At  present,  however, 
nuclear  reactors  are  a  potentially  dangerous  way  of  generating  power.  This  is 
due  first  to  the  possibility  of  nuclear  accidents  which  can  spread  radioactive 
fission  products  such  as  ^’’Sr  into  the  environment,  and  secondly  to  the  growing 
problem  of  the  storage  of  radioactive  wastes. Power  generated  from  nuclear 
fission  was  immediately  recognized  as  an  alternative  to  the  energy  generated  by 
burning  fossil  fuels. The  use  of  fossil  fuels  causes  air  pollution  which 
results  in  acid  rain,  various  human  diseases,  and  may  even  contribute  to  global 
warming. Nuclear  fission  power  is  used  in  countries  like  Japan  and  France 
to  generate  a  significant  portion  of  their  energy  needs  because  these  countries 
have  no  natural  fossil  fuel  resources  and  are  therefore  forced  by  economic 
necessity  to  utilize  nuclear  fission  reactors  for  generating  electricity.  No 
major  nuclear  accident  has  occurred  in  these  countries,  but  serious  nuclear  acci- 
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dents  have  occurred  in  the  United  States  and  Ukraine.  Therefore  scientists  and 
engineers  are  now  looking  for  alternative  energy  sources  which  are  safe  and  non¬ 
polluting. 

But  what  are  the  alternatives  to  fossil  fuels  and  conventional  nuclear 
fission  power?  A  glance  at  history  shows  that  people  have  always  used  their 
minds  to  bend  nature  to  their  advantage. And  today  the  search  for  new  en¬ 
ergy  sources  continues  in  various  directions  such  as  solar  power,  wind  power, 
geothermal  energy,  biomass  energy  and  nuclear  fusion  power.  These  new  en¬ 

ergy  sources  are  nowhere  near  the  point  of  scientific  and  engineering  develop¬ 
ment  where  they  can  be  used  as  alternative  power  sources  to  fossil  fuels  or  nu¬ 
clear  fission  power.  Nuclear  fusion,  the  power  source  of  the  stars,  remains 
etherlal  after  many  years  of  research  and  may  ultimately  be  a  chimera.  In  view 
of  this  it  may  be  wise  to  take  a  second  look  at  nuclear  fission  power  to  see  if 
new  concepts  can  be  developed  which  will  allow  nuclear  reactors  to  be  designed 
for  safe  and  environmentally  benign  operation.  A  clean  fission  nuclear  reactor 
core  design  concept  has  been  proposed  that  uses  y  ray  catalyzed  thermal  neutron 
induced  binary  fission  of  subactinide  elements. In  the  present  paper  a  y  ray 
catalyzed  thermal  neutron  Induced  quaternary  fission  nuclear  reactor  is  inves¬ 
tigated  that  uses  actinide  elements  as  a  source  of  clean  fission  power.  In  this 
case  the  y  rays  are  used  to  suppress  binary  fission  and  catalyze  quaternary  fis¬ 
sion  in  the  actinides  whose  reaction  product  nuclei  will  be  relatively  light 
weight  nuclei  having  only  low  level  beta  decays. 

The  use  of  y  rays  for  the  suppression  of  binary  fission  and  the  enhance¬ 
ment  of  quaternary  fission  in  the  actinides  is  possible  because  atomic  nuclei 
are  systems  of  matter  that  can  exhibit  broken  internal  symmetries  which  are  ul¬ 
timately  related  to  the  broken  internal  symmetries  of  spacetime.^’  The  broken 
symmetries  of  spacetime  are  described  by  representing  time  and  space  coordinates 
as  complex  numbers  in  the  following  way^^ 

t  =  t  exp(j0^)  (1) 

and  for  cartesian  coordinates 

X  =  X  exp(j0^)  y  =  y  exp(j0y)  z  =  z  exp(je^)  (2) 

while  for  spherical  polar  coordinates 

r  =  r  exp(j0^)  ^  =  4'  exp(j0^)  4'  =  ij'  exp(j0^)  (3) 

Corresponding  to  the  complex  number  azimuthal  angle  is  the  following  complex 

magnetic  quantum  number^ 

M  =  M  exp(i0u)  =  m  cos  0.  exp(-j6.)  (4) 

n 

where  m  =  0  ,  ±1  ,  i2  ,  ±3,  •••  is  the  standard  magnetic  quantum  number .  Therefore^’ 

M  =  m  cos  6.  9m  “"Qa  (5) 

<p  ri  <p 
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It  is  often  convenient  to  define  the  following  positive  magnetic  quantum  nuaber^^ 


M'  -  M*  exp(j0^)  - 

|m|  cos  0^  exp(-je^) 

(6) 

M'  -  |m|  cos  6. 

e'  “  -  0 

M  ♦ 

(7) 

For  6^-0  these  quantities  reduce  to  the  standard  concepts. 

It  is  assumed  that  the  integer  quantum  numbers  Z,  N  and  A  are  analogous  to 
the  magnetic  quantum  number  |m|  and  occur  in  a  solution  to  an  azimuthal  portion 
of  a  wave  equation  in  internal  space. Then  by  an  arqument  similar  to  that  for 
the  complex  number  magnetic  quantum  number  it  follows  that  the  atomic  number, 
neutron  number  and  atomic  mass  number  are  complex  numbers  in  an  internal  space 
and  can  be  written  in  a  form  similar  to  that  of  the  complex  magnetic  quantum 
number  M'  in  equation  (6)  as  follows  for  subactinide  nuclei  located  in  an  exter¬ 
nal  field^® 


z  =  z  exp(j0  )  =  Z  cos  0  exp(j0  )  (8) 

z  z  z 

n  =  n  exp(j0^)  =  N  cos  0^  exp(j0^)  (9) 

a  =  a  exp(j0  )  =  A  cos  0  exp(j0  )  (10) 

3i  d  d 

The  complex  number  azimuthal  angles  corresponding  to  the  quantum  numbers  z  ,  n 
and  a  are  written,  in  analogy  to  the  azimuthal  angle  of  real  space  given  in 
equation  (3) ,  as  follows 


♦z  =  *z 


♦n  =  *^n 


where  0^^  »  ®(tia  internal  phase  angles  of  the  complex  number 

azimuthal  angles  in  the  internal  space  of  nucleons,  so  that  finally  for  subac¬ 


tinide  nuclei  in  an  external  field' 


z  *  Z  cos  0 


n  «  N  cos  0 


a  =  A  cos  0 


6  =  -  0A 

z  <pz 


9  =  -  6a 
n  <|)n 


6  =  -  6a 

a  <t)a 


The  real  and  imaginary  parts  of  equations  (8)  through  (10)  are  given  for  sub¬ 
actinide  nuclei  in  an  external  field  by^® 

z„  =  Z  cos^0  =  Z  cos^0.  (lA) 

R  z  <pz 

2  2 

n„  =  N  cos  0  =  N  cos  0.  (15) 

R  n  <pn 

2  2 

a„  =  A  cos  0  =  A  cos  0.  (16) 

R  a  (fra 
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*I 

-  2 

cos 

6 

z 

sin 

0  - 

z 

-  Z 

cos 

sin 

(17) 

"l 

-  N 

cos 

e 

n 

sin 

0  - 

n 

-  N 

cos 

0. 

#n 

sin 

®(tin 

(18) 

-  A 

cos 

6 

a 

sin 

0  - 

a 

-  A 

cos 

^a 

sin 

%a 

(19) 

The  measured  atomic  number,  neutron  number  and  atomic  mass  number  are  Just  the 
real  values  given  by  equations  (14)  through  (16).  Because  the  measured  atomic 
number,  neutron  number  and  atomic  mass  number  for  subactinide  nuclei  in  an  ex¬ 
ternal  field  are  given  by  equations  (14)  through  (16)  it  follows  that  these 
quantities  are  nondenumerable  (non- Integer)  for  the  subactinides.  It  will  be 
shown  further  on  in  this  paper  that  this  is  not  the  case  for  actinide  nuclei  in 
an  external  field  where  now  the  measured  atomic  number,  neutron  number  and  atom¬ 
ic  mass  number  are  denumerable  (Integers). 

The  conventional  liquid  drop  model  of  the  nuclear  binding  energy  is  well 
described  in  the  literature. In  the  presence  of  an  electromagnetic  field 
the  complex  number  binding  energy  of  an  atomic  nucleus  is  written  as^^ 

B  =  E  -  E  -  E  -  E  +E  .  +E.,, 

V  s  c  sym  pair  shell  (^0) 

—2/3  7-2, -1/3  -X,-  -v2,-  ^  =  .  = 

=  aa  -  ya  -  6z  /a  -  6(n  -z)/a+E  .  +E.t, 

'  pair  shell 

where  for  subactinide  nuclei  z  ,  n  and  a  are  given  by  equations  (8)  through  (10) 
respectively,  Ey  .  Es  »  »  Es>Tn  •  %air  Eghell  “  complex  number  volume, 

surface,  Coulomb,  symmetry,  pairing  and  shell  energies  respectively,  and  where 
ci  ,  y  ,  6  and  6  =  complex  number  volume,  surface.  Coulomb  and  symmetry  energy 
coefficients  respectively.  The  mass  formula  coefficients  are  represented  as 


a  =  a  exp(j0^) 

y  =  y  exp(j9^) 

(21) 

6  =  6 

;  exp(j9^) 

6  =  6  exp(j9g) 

(22) 

E  .  =  E  .  exp(j9_  ,  ) 

pair  pair  Epair 

t,-/-3/4 
=  Pp/a 

^shell  "  ^shell 

(23) 

where 

p  =  p 

1  exp(j6p) 

\  Z  even,  A  even 

p  a,  34  MeV 

(24) 

P  « 

0  A  odd 

(25) 

^-1  Z  odd,  A  even 

and  where  for  subactinide  nuclei  equation  (12)  and  (23)  give 
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(26) 


E  ,  -  PpA-3/‘  co.-3/^ 

pair  a 


-  e  -  3/40 
Epalr  p  a 


and  where  the  shell  energy  has  a  more  complicated  variation.  For  the  subactinide 
elements  equations  (8)  through  (10)  and  (20)  through  (26)  give^® 

B  -  oA  cos  0  exp[j(6  +6^)1  -  cos^^^O^  exp[j(0^  +  2/30^)]  (27) 

cl  Q  a  a  y  3 

-  cos^e  cos"^^^e  expij(e  +  20  -  1/30  )i 

Z  3  o  Z  3 

-  explj(0g  +  28^  -  6^)] 

-3/4  -3/4 

+  PpA  '  cos  '  exp[j(ep  -  3/40^)] 

^shell  “P‘J«Eshell> 

where  the  complex  number  neutron  excess  is  given  by^® 

C  =  C  exp(j0^)  =  n  -  z  (28) 

The  measured  binding  energy  is  given  by  the  real  part  of  equation  (27)  so  that^® 

B  =  a  A  -  Y  A^''^  -  6  +  E®  +  E™  (29) 

m  m  m  m  m  pair  shell 

where  for  subactinide  nuclei 

a  =  a  cos  6  cos(0  +  0  )  (30) 

m  a  a  a 

Y  =  Y  cos^^^0  cos(0  +  2/30  )  (31) 

m  a  Y  a 

6=6  cos^0  cos"^^^0  cos(0,  +  20  -  1/30  )  (32) 

m  z  a'6za' 

6'  =  B  cos  ^0  cos(9„  +  20-  -  0  )  (33) 

m  a  B  5  a 

e“  .  =  PpA”^'^^  cos~^'^^0  cos(0  -  3/40  )  (34) 

pair  a  p  a 

The  calculation  of  the  y  ray  flux  that  is  required  to  catalyze  thermal 
neutron  induced  fission  in  the  subactinide  elements  using  thermal  neutrons  has 
already  appeared  in  the  literature.^®  This  calculation  is  based  on  the  deter¬ 
mination  of  the  fission  angle  0|  of  the  atomic  number  which  proceeds  as  follows. 
The  complex  number  generalization  of  the  Bohr-Wheeler  fission  instability  con¬ 
dition  is  given  by^ 

z^/a  =  ky/6  (35) 


or  equivalently  the  two  scalar  fission  stability  boundary  conditions  are 
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2^/a  -  Ky/6 


(36) 


0  -  20  -  0  +0.  (37) 

a  z  Y  u 

Combining  equation  (12)  with  equations  (36)  and  (37)  gives  the  fission  insta¬ 
bility  condition  for  subactinide  nuclei  located  in  an  external  field  as^^ 

Z^/A  ■  ky/6  cos  0  cos  ^0  (38) 

a  z 

_2 

“  ky/6  cos(26  +0.-0  )cos  0 
2  0  Y  2 

F 

Equation  (38)  can  be  solved  to  determine  the  value  of  required  to  fission 
subactinide  nuclei  with  thermal  neutrons  and  the  result  is^ 

F  1/2 

tan  0^  =  tan(0^  -  0^)  ±  sec(0^  -  6g)[l  “  X  cos(0^  -  6^)1  (39) 

where  the  fissility  parameter  x  is  given  by^® 

X  =  (Z^/A)(ky/6)"^  (40) 


and  is  restricted  to  the  range  0  <  x  ^  sec(6Y  -  65)  by  equation  (39)  and  de¬ 
scribes  most  subactinide  nuclei  except  for  some  exotic  proton  rich  subactinide 
nuclei  for  which  X  falls  outside  of  this  range.  The  value  of  the  fission  angle 
6^  of  the  atomic  mass  number  that  is  required  for  the  fission  of  subactinide 
nuclei  by  thermal  neutrons  is  obtained  from  equation  (37),  as^® 

0*^  =  20^  -  6  +  0.  (40A) 

a  z  Y  6 


F  F 

Figure  1  gives  0^  and  Figure  2  gives  03  for  subactinide  nuclei  having 
0  <  X  ^  sec(0Y  -  05)  and  using  the  values  of  the  angles  6y  =  0.4r  and  ©5  =  O.lr 
whose  values  are  selected  for  heuristic  purposes.  On  the  other  hand,  the  as¬ 
sumption  that  By  =  0(5  allows  equations  (39)  and  (40A)  to  be  written  as^^ 

tan  0^  'V,  ±  (1  -  0^  'V'  20^  0^  -v  30^  (41) 

z  ^  az  nz 

F  F 

which  are  valid  for  0  <  X  ^  1  •  Figure  3  gives  Figure  4  gives  0a  for 

subactinide  nuclei  under  the  approximation  that  0,^  =  05  . 


The  magnetic  induction  field  of  the  y  rays  that  are  required  for  the  ca¬ 
talysis  of  thermal  neutron  Induced  fission  of  subactinide  nuclei  is  then  given 
by2® 


By 

02 


tan 


z 


-  K®ni  - 

Oz 


(42) 


where 


dynamic  magnetic  stiffness  modulus. 


Values  of 


for  various  sub- 
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actinide  nuclei  have  been  tabulated  in  the  literature. This  review  of  the 
theory  of  y  ray  catalyzed  fission  of  subactinide  nuclei  by  thermal  neutrons  is 
sufficient  to  lead  into  the  problem  of  quaternary  fission  of  Y  ray  cooled  ac¬ 
tinide  nuclei  which  is  treated  in  the  following  sections. 

This  paper  considers  the  thermal  neutron  induced  quaternary  fission  of 

Y  ray  cooled  actinide  nuclei.  It  will  be  shown  that  the  atomic  number,  neutron 
number  and  atomic  mass  number  of  the  actinide  elements  in  an  external  field 
must  be  represented  by  special  kinds  of  complex  numbers  in  internal  space  such 
that  the  measured  atomic  number,  neutron  number  and  atomic  mass  number  are  in¬ 
tegers.  These  representations  suggest  that  it  is  possible  to  suppress  binary 
fission  by  thermal  neutrons  in  the  actinides  by  applying  a  y  ray  field  having 
the  proper  frequency  and  intensity.  The  result  will  be  an  enhanced  quaternary 
fission  rate  in  the  actinides  with  fission  products  that  are  relatively  low 
level  radionuclides.  Therefore  clean  fission  of  the  actinide  elements  using 
thermal  neutrons  is  possible  by  utilizing  the  cooling  effects  of  a  properly 
chosen  y  ray  bath.  An  outline  of  this  paper  is  as  follows:  Section  2  gives  the 
special  forms  of  the  complex  atomic  number,  neutron  number  and  atomic  mass  num¬ 
ber  for  the  actinide  elements  in  an  external  field.  Section  3  investigates  the 
theory  of  the  radioactive  decay  of  actinide  nuclei  in  an  external  electromag¬ 
netic  field.  Section  4  develops  a  liquid  drop  type  of  nuclear  mass  formula  for 
actinide  nuclei  in  an  external  field.  Section  5  presents  a  theory  of  the  sup¬ 
pression  of  thermal  neutron  induced  binary  fission  of  fissile  actinide  nuclei 
by  an  external  field.  Section  6  introduces  the  concept  of  thermal  neutron  in¬ 
duced  quaternary  fission  in  y  ray  cooled  actinide  nuclei  and  gives  the  required 

Y  ray  photon  energy,  number  density  and  flux  density  for  binary  fission  sup¬ 
pression,  and  finally  Section  7  gives  the  final  state  energy  conditions  for  the 
binary  fission  of  actinide  nuclei  in  a  low  intensity  y  ray  field. 

2.  ACTINIDE  NUCLEI  WITH  BROKEN  INTERNAL  SYMMETRIES.  This  section  con¬ 
siders  the  broken  internal  symmetries  of  the  atomic  number,  neutron  number  and 
atomic  mass  number  of  actinide  nuclei  that  are  subject  to  an  external  electro¬ 
magnetic  or  gravitational  field. 

A.  Complex  Atomic  Number,  Neutron  Number  and  Atomic  Mass 
Number  for  the  Actinide  Elements  in  an  External  Field. 

For  those  actinide  nuclei  which  exhibit  spontaneous  or  thermal  neutron 
induced  binary  fission  it  is  assumed  that  the  component  nucleons  are  in  a  mea¬ 
surably  denumerable  state  so  that  the  measured  atomic  number,  neutron  number 
and  atomic  mass  number  are  integers  even  in  the  presence  of  an  external  field. 
This  suggests  that  in  the  presence  of  an  external  field  the  atomic  number, 
neutron  number  and  atomic  mass  number  of  actinide  nuclei  are  complex  numbers  of 
the  form 


z  = 

z 

exp(j0^) 

=  z 

sec 

e 

z 

exp(j0^) 

(43) 

n  = 

n 

exp(je^) 

=  N 

sec 

e 

n 

exp(j0^) 

(44) 

a  = 

a 

exp(je  ) 

ol 

=  A 

sec 

0 

a 

exp(j0^) 

(45) 
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and  therefore  for  the  actinides 

z  •  Z  sec  e  n  ■  N  sec  0  a  =  A  sec  0 

z  n  a 

It  is  convenient  to  Introduce  the  following  terms 

I  *  sec  0  exp(j0  ) 

=  sec  0^  exp(j0^) 

1  “  sec  0  exp(j0  ) 

a  a  a 

Then  equations  (43)  through  (45)  can  be  written  as 

z-zl  n-Nl  i>=Al 

z  n  a 


(46) 

(47) 

(48) 

(49) 

(50) 


The  real  and  Imaginary  parts  of  equations  (43)  through  (45)  are  written  as 


Zr  =  Z  "r  “  N 

=  Z  tan  0  n,.  =  N  tan  0  a^  =  A  tan  0  (52) 

I  z  I  n  I  a 


The  measured  values  of  the  atomic  number,  neutron  number  and  atomic  mass  num¬ 
ber  are  just  the  corresponding  real  values  so  that 

z=Z  n=N  a=A  (53) 

m  m  m 


and  therefore  the  measured  values  of  the  atomic  number,  neutron  number  and 
atomic  mass  number  for  the  actinides  with  X  ^  1  in  an  external  field  are  de¬ 
numerable  (integers) .  This  is  ’ot  the  case  for  subactinide  nuclei  with  x  ^  1 
in  an  external  field  as  seen  in  equations  (14)  through  (16). 

The  law  of  addition  for  the  complex  atomic  number,  complex  neutron  number 
and  complex  atomic  mass  number  is  given  for  both  subactinide  and  actinide  ele¬ 
ments  in  an  external  field  as^® 

W  +  a  =  z  +  n  (54) 

subject  to  the  universal  law  of  baryon  number  conservation 

A  =  Z  +  N  (55) 


The  known  quantities  in  equation  (54)  are  taken  to  be  Z  ,  Sg  ,  N  and  0^  •  The 
value  of  A  is  determined  from  equation  (55)  while  the  unknown  quantities  W 
and  0a  are  obtained  by  taking  the  real  and  imaginary  parts  of  equation  (54)  and 
using  equations  (51)  and  (52)  with  the  result  that  for  actinide  nuclei 
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W  +  A  -  Z  +  N 


(56) 


A  tan  6  =  Z  tan  0  +  N  tan  6  (57) 

a  z  n 

Therefore  for  actinide  nuclei  in  an  external  field 

W  »  0  (58) 

tan  6  =  Z/A  tan  0  +  N/A  tan  0  (59) 

a  z  n 

=  tan  0  +  Z/A(tan  0  -  tan  6  ) 

n  z  n 

Equation  (59)  can  also  be  written  as 

tan  0  =  (A  tan  6  -  Z  tan  0  )/(A  -  Z)  (60) 

n  a  z 

tan  6  *  (A  tan  0  -  N  tan  0  )/Z  (61) 

z  an 

which  are  valid  for  actinide  nuclei.  Combining  equations  (54)  and  (58)  gives 

a  =  z  +  n  (62) 

which  is  valid  only  for  actinide  nuclei  because  in  this  case  the  measured  nu¬ 
cleon  numbers  are  denumerable  as  shown  in  equation  (51),  so  that  W  =  0  for  ac¬ 
tinide  nuclei  as  shown  in  equation  (58).  If  ®n  then  equation  (59)  shows 

chat  the  following  condition  is  valid  for  the  actr'.nides 

0  -v  0  %  0  (63) 

a  z  n 

If  0a  -v-  20Z  for  actinide  nuclei  then  the  relationship  between  6z  and  0n  is  ob¬ 
tained  from  equation  (59)  to  be 

tan (20  )  -  Z/A  tan  0  =  N/A  tan  0„  (64) 

z  z  n 

The  expressions  for  W  and  0a  for  subactinide  nuclei  already  appear  in  the  lit¬ 
erature.^®  For  subactinide  nuclei  the  measured  nucleon  numbers  are  nondenumer 
able  as  shown  in  equations  (14)  through  (15)  and  V  ^  0 

B.  Time  Variation  of  the  Complex  Atomic  Number,  Neutron  Number 
and  Atomic  Mass  Number  for  Actinide  Nuclei. 

This  section  considers  the  time  variation  of  z  ,  n  and  i  for  actinide 
nuclei.  Two  special  cases  are  considered:  the  first  is  associated  with  changes 
in  Z  ,  N  and  A  due  to  nuclear  fission  and  other  radioactive  decays  in  a  con¬ 
stant  external  field,  and  the  second  is  associated  with  the  variations  of  02  , 
0n  and  0a  that  are  related  to  a  time  varying  external  field  for_flxed_values 
of  Z  ,  N  and  A  .  The  general  case  of  the  time  variation  of  z  ,  n  and  a  is  ob¬ 
tained  from  equations  (43)  through  (45)  as^® 
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dz/dt  -  cos  6^^  (dz/dt  +  jzd0^/dt)exp( j (0^  “  ®t  “ 

dn/dt  -  cos  (dn/dt  +  jnd0^/dt)expt j (0^  "  ®t  “ 

da/dt  -  cos  6^^  (da/dt  +  Jad0^/dt)exp( j (0^  “  ®t  “ 

where  3^^  is  given  by^’ 

tan  3j.^  -  t30j./3t  (68) 

The  internal  phase  angles  02  ,  Bn  and  0a  are  functions  of  the  external  field, 
such  as  an  electromagnetic  field,  so  that 

0,  -  0,(Z,H)  -  0„(N,H)  0^  -  e^(A.H)  (69) 

z  z  on  o  a 


where  H  ■>  amplitude  of  the  magnetic  field  component  of  an  electromagnetic  field. 

The  time  variation  of  the  internal  phase  angles  are  written  as 

d0  /dt  =  30  /3Z  dZ/dt  +  30  /3H  dH/dt  (70) 

z  z  z 

de  /dt  =  30  /3N  dN/dt  +  30  /3H  dH/dt  (71) 

n  n  n 

d0  /dt  =  30  /3A  dA/dt  +  30  /3H  dH/dt  (72) 

a  a  a 


Combining  equations  (46)  with  equations  (65)  through  (72)  gives  for  the  actinides 


dz/dt  =  cos  3,,  (B’dZ/dt  +  C’dH/dt)exp[ j (0  -  0  -  8^^)]  (73) 

tcz  z  zttt 

dn/dt  =  cos  8  (B'dN/dt  +  C'dH/dt)exp[ j (0  -  6.  -  B  )]  (74) 

Cl  C  Tl  C  C  C 

da/dt  =  cos  8..  (B'dA/dt  +  C'dH/dt)expt j (6  -  0.  -  8^^)]  (75) 

CC  d  3  3  C  uC 

where 

B'  =  sec  0  [1  +  (tan  0  +  j)Z30  /3Z]  (76) 

z  z  z  z 

B'  =  sec  0  [1  +  (tan  0  +  j)N30  /3N]  (77) 

n  n  n-’  n 

B'  =  sec  0  [1  +  (tan  0  +  j)A30  /3A]  (78) 

3  3  3  3 

=  sec  0^  (tan  0^  +  j)Z30^/3H  (79) 

C'  »  sec  0  (tan  0  +  j)N30  /3H  (80) 

n  n  n  n 

C'  »  sec  0  (tan  0  +  j)A30  /3H  (81) 

3  3  3  3 

where 
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(82) 


tan  0  +  j  ■  sec  0  exp(j(Tr/2  -  0  )] 

z  z  z 

tan  +  j  *  sec  0^  exp[j(Tf/2  -  0^)1  (83) 

tan  0^  +  j  -  sec  0  exp[j(Tf/2  -  0  )]  (84) 

Two  special  cases  need  to  be  considered. 

Case  a.  Actinide  Nuclei  In  a  Constant  Magnetic  Field. 

This  case  corresponds  to  nuclear  reactions  such  as  the  fission  of  the  ac¬ 
tinides  in  a  constant  magnetic  field  and  equations  (73)  through  (75)  become 

(dz/dt)y  =  cos  BMZ/dt  exp[j(0^  ~  ®t  ~  ^tt^^ 

(dn/dt)jj  =  cos  BMN/dt  exp[j(e^  “  ®t  “  ®tt^^ 

(da/dt)jj  =  cos  Bj.j.  BMA/dt  exp[j(e^  “  ®t  ~  ^tt^^ 

For  this  case  the  actual  size  of  the  nucleus  is  changing. 

Case  b.  Actinides  with  Constant  Z,  N  and  A  . 

This  case  corresponds  to  changes  in  the  internal  phase  angles  ,  0n  and 
6a  in  a  time  varying  electromagnetic  field.  Then  equations  (73)  through  (75)  give 

(dz/dt)^  =  cos  Bj.^  CMH/dt  exp[j(0^  “  ®t  ~  ^tt^^ 

(dn/dt)j^  =  cos  Bj.^  CMH/dt  exp[j(e^  “  ®t  “  ^tt^^ 

(da/dt)  =  cos  6^^  C'dH/dt  exp[j(e  -  6^  -  B^^)]  (90) 

A  tt  a  a  t  tt 

Case  b  corresponds  to  nuclei  gaining  or  losing  energy  by  internal  space  rota¬ 
tions  . 

The  time  derivatives  of  z  ,  n  and  a  given  in  equations  (73)  through  (75) 
can  also  be  written  in  a  more  general  form  involving  only  one  exponential  term 
by  noting  that  equations  (43)  through  (45)  give 

dz/dt  =  cos  6  sec  B  dz/dt  exp(i4>  )  (91) 

tt  zz  zt 

=  cos  B  CSC  B  zd0  /dt  exp(j4»  ^) 
tt  zz  z  zt 

=  sin  B.,.,  sec  B  t  ^dz/d0  exp(j4>  ) 

tt  ZZ  t  zt 

=  sin  B^..  CSC  B  z/t  d6  /d6^  exp(j4'  .) 
tt  zz  z  t  zt 
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(92) 


dn/dt  ■  cos  6^.  sec  3  dn/dt  exp(j4 

tt  nn  nc 

-  cos  CSC  nde^/dt  exp(j4>^^) 

-  sin  B^^  sec  B„„  t~^dn/de.  exp(j*  ) 

cc  nn  c  nc 

-  sin  B^^  CSC  B„„  n/t  dd  /dd^  exp(j*  ) 

cc  nn  n  c  nc 

da/dc  “  cos  B^^  sec  B  da/dt  exp(j*  )  (93) 

Cu  dd  Aw 

-  cos  Bj.^  esc  B^g  ad0^/dc  exp(j4'^^) 

=  sin  B^^  sec  B  t  ^da/de^  exp(j4'  ) 
tt  aa  t  at 

»  sin  B^^  CSC  B^^  a/t  dd^/dO^  exp(j*^^) 

where  B|-t-  is  given  by  equation  (68) »  and  where 

tan  B  “  z30_/3z  tan  B  “  n30_/3n  tan  B  ^  ■  a30  /3a  (94) 

zz  z  tin  n  ad  a 


zt 

0 

z 

+ 

^zz 

-  ®t  - 

^t 

(95) 

1  s 

nt 

0 

n 

+ 

^nn 

-6t- 

^t 

(96) 

>  as 

at 

0 

a 

+ 

^aa 

-®t- 

^t 

(97) 

The  derivatives  dz/dt  ,  dn/dt  and  da/dt  are  evaluated  from  equation  (46)  for 
actinide  nuclei  as 

dz/dt  =  dZ/dt  sec  0  +  Z  sec  0  tan  0  d6  /dt  (98) 

z  z  z  z 

=  sec  0  (dZ/dt(l  +  tan  0  Z30  /3Z)  +  Z  tan  0  30  /3H  dH/dt] 

z  z  z  z  z 


dn/dt  =  dN/dt  sec  0  +  N  sec  0  tan  0  d0  /dt  (99) 

n  n  n  n 

=  sec  0  [dN/dt(l  +  tan  0  N30  /3N)  +  N  tan  0  30  /3H  dH/dt] 

n  n  n  n  n 


da/dt  =  dA/dt  sec  0  +  A  sec  0  tan  0  d0  /dt  (100) 

a  a  a  a 

=  sec  0  [dA/dt(l  +  tan  0  A30  /3A)  +  A  tan  0  30  /3H  dH/dt] 

a  a  a  a  a 

It  is  assumed  that  302/9t  =  0  ,  30n/3t  «=  0  and  dd^/dt  ■  0  .  The  derivatives 
in  equations  (98)  through  (100)  are  used  in  equations  (91)  through  (93). 
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Combining  equations  (46),  (70)  through  (72),  (91)  through  (93),  and 
(98)  through  (100)  gives  for  actinide  nuclei 


dz/dt  *  J'  cos  sec  8  sec  0  exp(j*  ^) 
z  tt  zz  z  zt 


(101) 


*  I  cos  8^^  CSC  8  sec  6  exp(j4  ^) 

z  tt  zz  z  zt 

*  (I^  +  cos  8^^  sec  e  exp(14  .) 

z  z  tt  z  zt' 


dn/dt  =  J'  cos  8  ^  sec  6  sec  8  exp(1*  ) 
n  tt  nn  n  nt 


(102) 


=  I 

cos 

CSC 

6 

sec 

0 

exp(j* 

n 

tt 

nn 

n 

^  nt 

+ 

1/2 

cos 

^t 

sec 

6  exp(j*  .) 

=  J' 
a 

cos 

^t 

sec 

®aa 

sec 

6 

a 

exp(j*at) 

=  I 

a 

cos 

^t 

CSC 

^aa 

sec 

0 

a 

exp(j'I>^j.) 

+ 

1/2 

cos 

^t 

sec 

®a 

(103) 


where  ,  *nt  and  ♦at  given  by  equations  (95)  through  (97)  and  where  for 
actinide  nuclei 


tan  8  =  I  /J' 

zz  z  z 


tan  8  =  I  /J' 

nn  n  n 


tan  8  =  I  /J' 

aa  a  a 


(104) 


tan  0^2  dZ/dt  +  Z/H  tan  dH/dt 

tan  a|L,  dN/dt  +  N/H  tan  aj?  dH/dt 
NN  nn 

tan  dA/dt  +  A/H  tan  a„„  dH/dt 
AA  nH 


(105) 

(106) 
(107) 


H  Z 

J'  =  (1  +  tan  0  tan  a„_)dZ/dt  +  Z/H  tan  9^  tan  a  dH/dt 
z  z  ZZ  z  Hn 

J'  =  (1  +  tan  e  tan  a^,)dN/dt  +  N/H  tan  0  tan  a^„  dH/dt 
n  n  NN  n  Hn 

H  A 

J'  =  (1  +  tan  0  tan  a,,)dA/dt  +  A/H  tan  0  tan  dH/dt 
a  a  AA  a  mh 


(108) 

(109) 

(110) 


and  where 
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tan  -  N3e  /3N  tan  a”  -  A3e  /3A  (111) 

NM  d 

tan  a”  -  H3e„/3H  tan  -  H3e  /3H  (112) 

an  u  nn  8 

Equations  (104)  through  (110)  are  valid  for  actinide  nuclei  in  an  external 
electromagnetic  field. 

Two  special  cases  are  of  Interest  for  application  to  nuclear  reactions 
and  transmutations  of  the  actinides. 

Case  a.  External  Radioactive  Decay  in  a  Constant  Electromagnetic  Field. 
For  this  case  it  follows  from  equations  (105)  through  (110)  that  for  the 


actinides 

I**  -  tan  o?_  dZ/dt  (113) 

z  ZZ 

I**  -  tan  a”  dN/dt  (114) 

n  NN 

l”  =  tan  a”  dA/dt  (115) 

8  rirk 

J^’  »  (1  +  tan  0  tan  a”  )dZ/dt  (116) 

Z  Z 

j”'  =  (1  +  tan  0^  tan  a^)dN/dt  (117) 

j”'  =  (1  +  tan  0  tan  a^JjdA/dt  (118) 

8  3  AA 


tan 

6*  - 
ZZ 

z  z 

=  tan 

H 

®ZZ 

(1 

+ 

tan 

0 

z 

tan 

H  .-1 
o»..) 

ZZ' 

(119) 

tan 

b”  - 

nn 

n  n 

=  tan 

H 

“nn 

(1 

+ 

tan 

0 

n 

tan 

H  .-1 

“nn^ 

(120) 

tan 

B«  = 
aa 

!«//' 
a  a 

=  tan 

H 

“aa 

(1 

+ 

tan 

0 

a 

tan 

H  .-1 

“aa> 

(121) 

“L  “  “hh  “  “hh  “ 

From  equations  (85)  through  (87)  and  (101)  through  (122)  it  follows  for 
constant  H  that  for  the  actinides 

(di/dt)„  =  cos  B  sec  6**  sec  0  exp(j***  )  (123) 

tl  Z  LU  ZZ  Z  Zu 

=  I  cos  B^^  CSC  B  sec  0  exp(i*  ) 
z  tt  ZZ  z  zt 

r/T^i^2  ^  ,^H\2,l/2  o  . 

=  1  ^tt  ®z  ®*P^j*zt^ 


tan  a?_  -  Z30  /3Z 
Zm  z 

tan  oH„  -  H30  /3H 

an  Z 
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(dn/dt)jj  -  ^  cos  sec  6®^  sec  6^  sec(j**^)  (124) 

-  ij  CO.  6„  CSC  B*„  ..c  «xp(J«*j) 

“  cos  sec  exp(j**^) 

(da/dt)jj  -  cos  6^^  sec  sec  6^  exp(J*^^)  (125) 

*  cos  6^^  CSC  6*^  sec  6  exp(j*^  ) 
a  tt  aa  a  at 

“  [(Ig)  +  (J^  )  ]  cos  Bj.j.  sec  6^  exp(j4^^) 

vhere 


zt 

6 

z 

+ 

zz 

-  ^  - 

^t 

(126) 

II 

u 

X  c 

0 

n 

+ 

8« 

nn 

-  ®t  - 

^t 

(127) 

at 

0 

a 

+ 

aa 

-  ®t  - 

^t 

(128) 

H  H  H 

where  S^z  .  6^^  and  6^^  given  by  equations  (119)  through  (121).  It  is  easy 
to  see  that  equations  (85)  through  (87)  are  equivalent  to  equations  (123) 
through  (125). 

Case  b.  Internal  Phase  Radioactive  Decay  of  the  Actinides 
due  to  a  Time  Dependent  Electromagnetic  Field. 

The  case  at  hand  corresponds  to  constant  values  of  Z  ,  N  and  A.  Equa¬ 
tions  (104)  through  (112)  give  for  the  actinides 


tan 

cot 

0 

B^ 

=  k/2 

-  0 

(129) 

zz 

Z 

zz 

Z 

tan 

b” 

— 

cot 

e 

b" 

=  ir/2 

-  e 

(130) 

nn 

n 

nn 

n 

tan 

B^ 

s 

cot 

0 

8^ 

=  Tr/2 

-  0 

(131) 

aa 

a 

aa 

a 

CSC 

b" 

sec 

0 

oN 

CSC  B 

=  sec 

0 

CSC 

oA 

B  =  sec 

0 

(132) 

zz 

Z 

nn 

n 

aa 

a 

sec 

b" 

CSC 

0 

oN 

sec  B 

=  CSC 

0 

sec 

B  =  CSC 

0 

(133) 

ZZ 

z 

nn 

n 

aa 

a 

H 

ZZ 

=  Tr/2 

H 

“nn  = 

/2 

H 

“aa 

=  Tl/Z 

(134) 
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(135) 


if  -  Z/H  tan  dH/dt 
Z  nU 

l”  -  N/H  tan  aJJjj  dH/dt  (136) 

-  A/H  tan  dH/dt  (137) 

z'  z 

■  Z/H  tan  0^  tan  dH/dt  (138) 

jJJ  -  N/H  tan  0  tan  a|!„  dH/dt  (139) 

n  n  HH 

-  A/H  tan  0  tan  dH/dt  (140) 

3  d  HH 


Combining  equations  (135)  through  (140)  and  using  equation  (112)  gives  the 
following  results  for  constant  Z  ,  N  and  A 

[(if)^  +  (jf’)^l^^^  -  Z/H  tan  oJjj  sec  0^  dH/dt  (141) 

-  Z  30  /3H  sec  0  dH/dt 


(dJJ)^  +  (/')^]^^^  =  N/H  tan  aJI„  sec  0^  dH/dt  (142) 

n  n  nn  n 

-  N  30  /3H  sec  0  dH/dt 
n  n 


[(I^)^  +  (J^')^]^^^  =  A/H  tan  sec  0  dH/dt  (143) 

a  a  HH  a 

=  A  30  /3H  sec  6  dH/dt 

3  3 

Combining  equations  (101)  through  (103)  with  equations  (141)  through  (143) 
gives  for  the  actinides  with  constant  Z  ,  N  and  A 

(di/dt)_  =  Z  cos  sec^0  30  /3H  dH/dt  exp(j$^^)  (144) 

Z  tt  z  z  zt 

(dn/dt)jj  =  N  cos  6^^  sec^0^  30^/3H  dH/dt  exp(j4>||^)  (145) 

(da/dt).  =  A  cos  6^^  sec^0  36  /3H  dH/dt  exp(j4'^^)  (146) 

A  tt  a  a  at 


where  equations  (95)  through  (97)  and  equations  (129)  through  (131)  give 


0  +  B^ 
z  zz 


(147) 
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♦*  -  e  +6**  - 

nt  n  nn  t  tt 


■  -  »tt 


>  -8  +3  -6  -8 
at  a  '^aa  t  ‘^tt 


./2  -  6^  -  3,, 


(148) 


(149) 


Therefore  for  the  Internal  phase  nuclear  reactions  of  the  actinides 


A 

zt  nt  at 


»/2  -  -  6^^ 


(150) 


which  is  a  result  that  is  different  from  the  analogous  case  for  the  subactinide 
elements. Equations  (144)  through  (146)  can  also  be  written  as 


Oz/3H),  •»  1Z  sec^e  36  /3H  (151) 

Z  2  Z 

(3n/3H)„  =  jN  sec^e  30  /3H  (152) 

N  n  n 

(3a/3H)^  =  jA  sec^e,  3e^/3H  (153) 

A  A3 

which  actually  corresponds  to 

(dz)„  -  jZ  sec^e  de  (154) 

L  Z  Z 

(dH)„  =  iN  sec^e^  de„  (155) 

Cl  n  n 

(di).  *  jA  sec^0  d6  (156) 

A  3  3 


which  is  easily  derived  from  equations  (43)  through  (45)  and  equations  (82) 
through  (84)  for  the  internal  phase  rotations  of  the  actinides. 

3.  RADIOACTIVE  DECAY  OF  THE  ACTINIDE  NUCLEI  IN  AN  EXTERNAL  FIELD.  This 
section  investigates  the  radioactive  decay  of  a  collection  of  actinide  nuclei 
in  the  presence  of  an  external  electromagnetic  or  gravitational  field.  Internal 
phase  angles  are  Induced  into  the  number  of  nuclei,  into  the  total  number  of 
constituent  nucleons  and  into  the  atomic  mass  number  of  each  nucleus.  An  ad¬ 
dition  law  for  actinide  nuclei  is  developed  that  relates  the  complex  total  nu¬ 
cleon  number,  the  complex  number  of  atomic  nuclei  and  the  complex  atomic  mass 
number . 


A.  Addition  Law  for  Actinide  Nuclei  with  Complex  Total  Nucleon  Number, 
Complex  Number  of  Atomic  Nuclei  and  Complex  Atomic  Mass  Number. 

In  the  presence  of  an  external  field  such  as  gravity  or  an  electromagnetic 
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field  the  particle  number  of  any  system  of  particles  must  be  represented  by 
complex  numbers  in  an  internal  space. Therefore  for  nuclei  in  an  electromag¬ 
netic  field  the  complex  number  of  total  nucleons  (protons  and  neutrond  within 


the  nuclei) ,  the  complex  number  of  atomic  nuclei  and 
number  are  written  as^® 

the  complex  atomic  mass 

(157) 

ii  -  N  exp(j0jj) 

(158) 

i  -  a  exp(je  ) 

4 

(159) 

where  and  6}],^  =  complex  number  value,  magnitude  and  internal  phase_angle 

of  the  total  number  of  nucleons  situated  within  all  of  the  atomic  nuclei;  N  ,  N 
and  6{i  ~  complex  number  value,  magnitude  and  internal  phase  angle  of  the  number 
of  atomic  nuclei;  and  as  before  a  ,  a  and  63  *  complex  number  value,  magnitude 
and  internal  phase  angle  of  the  atomic  mass  number  of  each  nucleus.  For  actin¬ 
ide  nuclei  it  follows  that  in  analogy  to  equations  (43)  through  (46)  the  parti¬ 
cle  number  magnitudes  that  appear  in  equations  (157)  through  (159)  are  «rritten  as 


n  sec  e„ 
n  Nn 

(160) 

n  sec  Ojj 

(161) 

A  sec  6 

(162) 

where  =  integer  number  of  total  number  of  nucleons  within  the  atomic  nuclei, 
n  *  Integer  number  of  atomic  nuclei,  and  as  before  A  =  atomic  mass  number  which 
is  an  integer.  The  Integer  numbers  satisfy  the  following  fundamental  universal¬ 
ly  valid  law  of  baryon  number  conservation 


=  nA 

tl 


(163) 


The  measured  nucleon  number,  nuclei  number  and  atomic  mass  number  are  given  by 
the  real  parts  of  equations  (157)  through  (159).  Using  equations  (160)  through 
(162)  gives  the  result 


N  =  n  ^ 
mn  n 


(164) 


which  are  integers  for  actinide  nuclei.  For  actinide  nuclei  the  measured  nu¬ 
cleon  number,  nuclei  number  and  atomic  mass  number  are  denumerable.  This  is 
not  the  case  for  subactinide  nuclei.^® 

Following  the  example  of  the  addition  law  given  by  equation  (54)  vdiich  is 
valid  for  a  single  nucleus,  the  addition  law  for  the  complex  particle  numbers 
is  written  as^® 

W  +  N  =  na  (165) 

n 

which  is  subject  to  the  universal  validity  of  equation  (163).  The  component 
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«|iMCloiis  of  oqiMtioo  (165)  for  00110140  noelol  oro  wrlttoo  wltk  tiio  iMlp  of 
oquotlon  (166)  oo 

w*  -  nA  (146) 

tan  0^^^  ■  nA  tan  0^  (167) 

which  give  the  relations 


W 


0„  -  0 

Nn  a 


(168) 


which  are  valid  for  actinide  nuclei.  Therefore  for  actinide  nuclei 


N 


n 


na 


N 


n 


na 


Nn 


(169) 


and  equation  (169)  is  equivalent  to  the  Integer  relationship  given  by  equation 
(163).  This  situation  is  much  simpler  than  the  case  for  subactlnlde  nuclei 
where  W'  0 


B.  Radioactive  Decay  of  Actinide  Nuclei  in  the  Presence  of 
an  Electromagnetic  Field. 

This  section  considers  the  radioactive  decay  of  the  actinides  in  a  time 
varying  external  field  such  as  electromagnetism  or  gravitation.  The  complex 
number  generalization  of  the  standard  radioactive  decay  law  for  the  heavy  el¬ 
ements  is  written  as^® 

dN/dt  =  -  XN  (170) 

where  N  *  complex  number  of  atomic  nuclei  and  X  »  complex  number  radioactive 
decay  constant  which  can  be  written  as^* 

X  *  X  exp(j0^)  (171) 

From  equations  (158)  and  (161)  it  follows  that  N  can  be  written  for  actinide 


elements  as 

N  =  N  exp(jejj)  N  »  n  sec  (172) 

=  n  sec  Ojj  exp(jejj) 

The  time  derivative  of  equation  (172)  is  given  by^® 

dN/dt  =  cos  3^^  sec  3jjjj  dN/dt  exp(j4>jjj.)  (173) 

»  cos  3^^  CSC  3jjjj  N  dOjj/dt  exp(j»jj^)  (174) 

=  sin  3^j.  sec  3jjjj  t  ^  dN/de^  exp(j$jj^)  (175) 

=  sin  3^^  CSC  3jqjj  N/t  de^^/de^  exp(j<^jj^)  (176) 
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vhere 


tan  B|g|  •  Id6||/3M 
♦«t  “  ®IOl  "  ®t  ’ 


(177) 

(178) 


Then  the  law  of  radioactive  decay  of  elements  given  in  equaticm  (170)  can  be 
written  in  any  of  the  following  forms^* 


cos  sec  Bjjjj  dM/dt  -  -  XN 

cos  CSC  dB^/dt  -  -  X 

tt  Nn  n 

sin  B^^  sec  Bjjjj  t  ^  dN/dB^  «  -  XK 

sin  B^j  CSC  Bju  t”^  dBij/dfl^  -  -  X 


(179) 

(180) 
(181) 
(182) 


cosd>lned  with  the  following  phase  angle  relationship 
♦«t  "  ®X  ^  ®II 


(183) 


Combining  equations  (178)  and  (183)  gives^ 


8|®  -  *t  - 


(184) 


The  derivative  dB{(/dt  that  appears  in  equation  (174)  is  written  as 
dBjj/dt  =  3Bjj/3n  dn/dt  +  3Bjj/3H  dH/dt 


(185) 


The  derivative  dN/dt  that  appears  in  equation  (173)  is  obtained  from  equation 
(172)  to  be  for  actinide  nuclei 


dN/dt  =  dn/dt  sec  Bjj  +  n  sec  Bjj  tan  dBj^/dt 

“  sec  Bjj  [dn/dt(l  +  tan  Bj^  n36jq/3n)  +  n  tan  Bj^  3Bjg/3H  dH/dt] 


(186) 


Equations  (170),  (171)  and  (173)  through  (185)  are  valid  for  the  radioactive 
decay  of  all  nuclei  but  equations  (172)  and  (186)  refer  only  to  actinide  nuclei. 

Another  form  of  the  law  of  radioactive  decay  that  applies  only  to  the 
actinides  can  be  obtained  in  analogy  to  equations  (101)  through  (103)  by  noting 
that  the  time  derivative  that  appears  in  the  radioactivity  lav  in  equation  (170) 
can  be  obtained  from  equations  (173),  (174),  (185)  and  (186)  as 


dS/dt  «  cos  B^^  sec  Bjjjj  sec  Bjj  exp(j*jj^) 

«  cos  B^^  CSC  Bjjjj  sec  Bj^  Ijj  exp(j*jj^) 

«  cos  B^j^  sec  Bjj  (I^  +  J^2)l/2  exp(j*jj^) 


(187) 

(188) 
(189) 
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«lMr« 


Ijl  •  tan  dn/dt  ♦  n/H  tan  a^g  ®/<*t 

Jg  ■  (1  +  tan  Og  tan  o®^)dn/dt  +  n/H  tan  Gg  tan  dH/dt 

tan  gjjjj  -  N30g/3N  «  Ig/J^ 
tan  -  n3ejj/3n 

tan  ajJjj  -  H3ejj/3H 

Combining  equations  (170) «  (172)  and  (189)  gives  for  the  actinides 
cos  sec  eg  (1^  +  exp(j*gp 

»  -  Xn  sec  Sg  exp[j(0j^  +  Sg)] 

which  gives  the  radioactive  decay  law  as 
cos  8^^  (Ig  Jg^)^^^  =  -  Xn 


(190) 

(191) 

(192) 

(193) 

(194) 

(195) 


(196) 


along  with  the  phase  angle  relationship  given  by  equations  (183)  and  (184). 
Equation  (196)  is  equivalent  to  equations  (179)  and  (180).  Equation  (196)  for 
the  radioactive  decay  of  the  actinides  is  similar  to  equation  (255)  of  Refer¬ 
ence  26  which  describes  the  radioactive  decay  of  subactinide  nuclei.  The  dif¬ 
ference  is  the  +  signs  that  occur  in  the  expression  for  Jg  in  equation  (191) 
for  actinide  nuclei,  and  the  corresponding  negative  signs  that  occur  in  the  ex¬ 
pression  for  Jg  that  describes  the  radioactive  decay  of  subactinide  nuclei  as 
in  equations  (250)  and  (255)  of  Reference  26.  The  phase  angle  Bg  may  be  neg¬ 
ative  so  that  in  this  case 

tan  ®g  “  ~  tanlSgl  (197) 


The  angle  0g  can  be  positive  or  negative. 

The  case  of  Incoherent  radioactive  decay  of  the  actinides  can  be  regained 
by  taking  Bg  =  constant,  0^  =  0  and  Btt  =  0  in  which  case  equations  (170),  (173) 
and  (186)  become 

sec  Bg  dn/dt  exp(j0g)  =  -  Xn  sec  0g  exp[j(0g  +  0^^)]  (198) 

3  7  3  8 

which  gives  the  conventional  law  of  radioactive  decay  of  elements  as  ' 

dn/dt  =  -  Xn  ®X  ~  ^  (199) 
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Equation  (209)  gives  the  internal  phase  angle  of  th<i  radioactive  decay  constant 
for  atomic  nuclei  in  a  constant  magnetic  field. 

C.  Coherent  Radioactive  Decay  of  the  Actinides. 

This  ‘'cction  considers  the  case  where  n  >  constant  which  corresponds  to 
radioactive  decays  where  the  integer  number  of  actinide  nuclei  remains  constant 
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and  only  the  Internal  phase  angle  Sjj  changes  due  to  the  presence  of  a  time  de¬ 
pendent  external  electromagnetic  field.  For  n  =  constant,  equations  (190), 
(191)  and  (194)  give  for  actinide  nuclei 

iJJ  n/H  tan  ojjjj  dH/dt  (210) 

-  n  dGjj/dt 

=  n/H  tan  tan  dH/dt  (211) 

*  n  tan  6„  d0.  /dt 
N  n 

and  equation  (192)  gives 

tan  bJjj  =  cot  Ojj  aJJjj  =  it/ 2  -  6^^  (212) 

Equations  (196),  (210),  (211)  and  equations  (184)  and  (212)  give  the  internal 
phase  radioactive  decay  equations  for  actinide  nuclei  as  follows 


cos  sec  e„  d9„/dt  =  -  X 
tt  N  N 


^  -  ^t 

=  it/2  -  -  6^^ 


(213) 

(214) 


where  6^.^.  is  given  by  equation  (68)  .  The  internal  phase  radioactive  decay 
equation  (213)  can  also  be  obtained  from  equations  (180)  and  (212).  Equations 
(213)  and  (214)  can  also  be  obtained  directly  from  equations  (170)  and  (172) 
or  from  equations  (189)  through  (191)  by  realizing  that  for  actinide  nuclei 
with  n  =  constant 

(dii/dt)^  =  n  cos  sec^Sjj  d0jj/dt  exp[j(Tr/2  “  6^.  ~  (215) 

=  N  cos  sec  0jj  d0jj/dt  exp[j(ir/2  "  ~  ^tt^^ 

and 

(dN)^  =  n  sec^9jj  d0j^  exp(jir/2)  (216) 

2 

=  in  sec  0.,  d0„ 

=  jN  sec  0JJ  dSjj  exp(-jejj) 

=  N  sec  d0j^  exp[j(TT/2  -  0^^)  ] 


and  where 

l(dN)^|  =  n  sec^0j^  dOj^ 


(217) 
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Equation  (216)  gives  the  coherent  (n  =  constant)  change  of  the  complex  number 
of  actinide  nuclei  given  by  equation  (172).  Equations  (213)  and  (21A)  are 
coupled  simultaneous  differential  equations  that  determine  0jj  and  Of  for  ac¬ 
tinide  nuclei  that  are  undergoing  internal  phase  radioactive  decay.  For  slow¬ 
ly  changing  values  of  0t  equation  (214)  is  written  as 

0^  =  tt/2  -  “  ®t  ■  t80^/at  (218) 

which  is  an  approximate  differential  equation  for  0j|j  and  6(-  . 

It  has  been  shown  that  a  solution  to  equation  (213)  is  given  by^^ 

(sec  0JJ  +  tan  0jj)/(sec  0°  +  tan  0^)  =  exp[-  Ag(t)]  (219) 

where 

t 

g(t)  =  J  sec  6  dt  (220) 

o 

where  0jj  =  0^  for  t  =  0  ,  and  for  t  =  “  it  follows  that^® 

0^  =  -  Tr/2  (221) 

From  equation  (214)  it  follows  that  for  t  ->•  “  and  for  actinide  nuclei 

e,  =  .  -  -  6^^  (222) 

'v  n  -  6^  -  t30j./3t 

which  is  a  differential  equation  for  0^.  .  Therefore  for  the  internal  phase 
radioactive  decay  of  actinide  nuclei  in  the  limit  of  t  »  equation  (222)  gives 

0  =  TT  -  0  +  c/t  (223) 

L  A 

where  c  =  constant.  For  t  =  “  equation  (223)  gives 

0*  =  tt  -  0  (224) 

L  A 

for  internal  phase  decays  of  actinide  nuclei. 

4.  NUCLEAR  MASS  FORMULA  FOR  ACTINIDE  NUCLEI  IN  AN  ELECTROMAGNETIC  OR 
GRAVITATIONAL  FIELD.  This  section  develops  a  broken  symmetry  form  of  the  liquid 
drop  type  or  nuclear  mass  formula  that  describes  the  nuclear  binding  energy  of 
actinide  nuclei.  The  nuclear  mass  formula  can  be  used  to  study  the  suppression 
of  thermal  neutron  induced  binary  fission  and  the  enhancement  of  quaternary 
fission  in  the  actinides  by  the  presence  of  a  y  ray  field  as  described  in  Sec¬ 
tions  5  and  6. 

A.  Complex  Number  Radius  for  Actinide  Nuclei. 


600 


The  simplest  scalar  expression  for  the  nuclear  radius  is  given 
1/3 

R  “  bA  b  =  1.2  fm  (225) 


b  =  1.2  fm 


In  an  external  electromagnetic  or  gravitational  field  the  nuclear  radius  is 
represented  by  a  complex  number  as  follows^® 

-  -1/3 

R  =  R  exp(j0j^)  =  ba  (226) 

where  the  complex  number  constant  b  is  written  as^® 


b  =  b  exp(j0^) 


(227) 


and  a  =  complex  atomic  mass  number  which  for  the  actinides  is  given  by  equation 
(45) .  Combining  equations  (45)  and  (226)  gives  for  the  actinides 


1/3  1/3 

R  «  bA  '  sec^'^0^  exp[j(6^  +  6^/3)] 


®R  -  «b  ^ 


(228) 


(229) 


The  measured  radius  of  an  actinide  nucleus  is  given  by  the  real  part  of  equa¬ 
tion  (228) 


R  =  R  cos  0_  =  bA^'^^  sec^'^^0  cos(0^  +  0  /3) 
m  R  aba 

which  can  be  rewritten  as 

R  .b'A‘/3 
m 

where  for  the  actinides 


(230) 


(231) 


b'  =  b  sec  0  cos(0,  +0/3) 

d  u  d 


(232) 


The  effective  radius  constant  b*  is  a  slowly  increasing  function  of  the  applied 
external  field.  Therefore  in  a  weak  external  field  b*  %  b  =  1.2  fm  ,  while  in 
a  strong  field  b'  >  b  . 

B.  Binding  Energy  of  Actinide  Nuclei  Located  in  an 
Electromagnetic  or  Gravitational  Field. 

The  standard  scalar  expression  for  the  binding  energy  B  of  a  nucleus 
(Z,A)  is  given  by  the  liquid  drop  model  as^®~^® 


B  =  E  -  E  -  E  -  E  +E  .  +E,,, 

V  s  c  sym  pair  shell 

=  oA  -  yA^^^  -  6Z^/A^^^  -  6(N  -  Z)^/A  +  Pp/A^^^  +  E 


(233) 


shell 


where  E^  ,  Eg  ,  E^  ,  Egy^  ,  Epgir  ^shell  “  volume,  surface,  Coulomb,  sym¬ 
metry,  nuclear  pairing  and  nuclear  shell  energies  respectively,  and  where  a  , 


601 


7  ,  6  and  6  =  volume,  surface.  Coulomb  and  symmetry  energy  coefficients  respec¬ 
tively,  and  where  from  equation  (55)  it  follows  that  N  -  Z  =  A  -  2Z  .  The  pair¬ 
ing  energy  is  given 


E  . 
pair 


=  Pp/A 


3/4 


(234) 


where  p  =  pairing  energy  coefficient  34  MeV  and  where  P  is  given  by  equation 
(25) .  The  average  binding  energy  per  nucleon  e  =  B/A  is  written  as 


e=e  -e  -e  -e  +e  .  +e.,, 

V  s  c  sym  pair  shell 


(235) 


a  -  -  6Z^/A^^^  -  6[(N  -  Z)/A]^  +  Pp/A^^^  +  e 


shell 


where  Cy  ,  Cg  ,  e^,  ,  ^sym  »  ^pair  ^shell  “  average  volume  energy  per  nu¬ 
cleon,  average  surface  energy  per  nucleon,  average  Coulomb  energy  per  nucleon, 
average  symmetry  energy  per  nucleon,  average  pairing  energy  per  nucleon  and  the 
average  shell  energy  per  nucleon  respectively.  More  complicated  forms  of  the 
nuclear  symmetry  energy  have  been  considered  by  including  the  effects  of  the 
nuclear  bulk  modulus.  However,  in  this  paper  only  the  simple  WeizsScker- 
Bethe  form  given  in  equation  (235)  is  considered. 


For  an  actinide  nucleus  in  the  presence  of  an  electromagnetic  or  gravi¬ 
tational  field  the  complex  number  nuclear  binding  energy  is  written  as^ 


B  =  E  -  E  -  E  -  E  +E  .  +E,,, 

V  s  c  S3mi  pair  shell 

—  —2/3  I-2/-1/3  s,.  -.2,-  ^  „-/-3/4  ^  - 

=  aa  -  ya  -  6z  /a  -  6(n  -  z)  /a  +  Pp/a  +  E  , 

shell 


(236) 


where  z  ,  n  and  a  are  given  by  equations  (43)  through  (45)  respectively,  Ey  ,  Eg  , 
Ec  »  Esym  ,  Epaj[j.  and  Eg^g].!  complex  number  volume,  surface.  Coulomb,  symmetry, 
pairing  and  shell  energies  respectively,  and  where  a  ,  y  »  6  and  B  =  complex 
number  volume  surface.  Coulomb  and  symmetry  energy  coefficients  respectively. 

The  mass  formula  coefficients  are  represented  as 


1  PI 

II 

p 

exp(je^) 

Y  =  y 

exp(je^) 

(237) 

6  =  6 

exp(j0^) 

B  =  6 

exp(j6g) 

(238) 

E  . 
pair 

=  E  .  exp(j0_  ,  ) 
pair  Epair 

^shell 

=  E  ,  . ,  exp(i6_  ,  , , ) 

shell  Eshell 

(239) 

where  the  complex  number  pairing  energy  is  written  as 

1  .  =  Pp/i^'^^  (240) 

pair 

where  the  complex  number  pairing  energy  coefficient  is  written  as 

p  =  p  exp(j0p)  (241) 
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so  that 


E  ,  -  Pp/a' 

pair 


0r.  ^  “  0  -  3/^0 

Epair  p  a 


(2A2) 


The  average  complex  number  binding  energy  per  nucleon  t  >  B/a  is  written  as 


e*e  -  e  -  e  -  e  +e  . 

V  s  c  sym  pair  shell 

=  a  -  y/a^^^  -  6z^/a^^^  -  6{(n  -  z)/a]^  +  Pp/a^^^  +  e 


(243) 


shell 


where  Cy  ,  Cg  ,  ,  ^sym  >  ^pair  ^shell  ^  complex  number  average  volume, 

surface.  Coulomb,  symmetry,  pairing  and  shell  energies  per  nucleon  respectively. 

The  complex  number  neutron  excess  that  appears  in  the  symmetry  energy 
terms  in  equations  (236)  and  (243)  is  written  for  actinide  nuclei  exactly  as 


1  =  5  exp(je^)  =  n-  z  =  i-2z 


(244) 


where  n  ,  z  and  a  are  given  by  equations  (43)  through  (45)  and  are  related  by 
equation  (62) .  Then  for  actinide  nuclei 


5^  =  n^  +  z^  -  2zn  cos(0  -  0  ) 

n  z 

2  2  2  2 

=  N  sec  0  +  Z  sec  0  -  2ZN  sec  0  sec  6  cos(6  -  0  ) 

n  z  z  n  n  z 


(245) 


tan  0_  =  (n  sin  0  -  z  sin  0  )/(n  cos  0  -  z  cos  0  ) 

5  n  z'  '  n  z' 


(246) 


=  (N  tan  0  -  Z  tan  0  )/(N  -  Z) 

n  z 

For  the  approximation  0z  '''  0n  >  which  is  valid  near  the  valley  of  beta  sta¬ 
bility,  it  follows  from  equations  (245)  and  (246)  that 


5  'V  n  -  z  =  N  sec  0  -  Z  sec  0  (N  -  Z)sec  0 

n  z  z 

0^  'V.  0  %  0 

5  z  n 


(247) 

(248) 


Combining  equations  (243)  and  (244)  gives  the  exact  equation 
e  =  a  -  -  8(5/a)^  +  Pp/i^^^  + 


shell 


(249) 


It  follows  from  equation  (244)  that  the  following  exact  equation  is  valid  for 
actinide  nuclei 
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C/i  -  5/a  exp[j(0^  -  0^)] 
*  1  -  2i/a 


(250) 


1  -  2(Z  sec  0,)/(A  sec  0  )  exp(j(0  -  0  )] 

Z  a  Z  o 


Combining  equations  (249)  and  (250)  gives  the  following  exact  expression  for 
the  average  energy  per  nucleon  for  actinide  nuclei 


£  =  5  -  -  6(1  -  2i/a)^  +  Pp/a^^^  +  e 


shell 


(251) 


which  is  a  simple  complex  number  generalization  of  the  standard  scalar  result. 

Equation  (236)  giving  the  complex  number  binding  energy  of  an  actinide 
nucleus  can  be  written  as 

2/3  2/3 

B  =  aA  sec  0^  exp[j (0  +  0  )]  -  yA  sec  '^0  exp[j(0  +  2/36  )]  (252) 

A  Ot  3  3  Y  3 

-  sec^0  sec"^^^6  exp[j(6^  +  26  -  1/30  )] 

2  3  O  Z  3 

-  65 sec“^0^  exp[j(6g  +  26^  -  6^)] 

+  PpA  sec  ^^^0  exp[j(0  -  3/40  )]  +  E  .  exp(j6  ,  ,.) 

a  p  a' ‘  shell  Eshell"^ 

As  an  approximation,  the  phase  angles  of  the  terms  in  equation  (252)  are  taken 
to  be  equal 


0„'v0  +  0  ^0  +  2/30 

B  a  a  Y  a 

0.  +  20  -  1/36  'v  0,  +  26^  -  0 

6  z  a  6  5  a 


(253) 


'^0  -3/40 

p  a  Eshell 

Then  the  magnitude  of  the  binding  energy  for  actinide  nuclei  is  obtained  from 
equation  (252)  to  be  approximately 


2/3  2/3  2  -1/3  2  -1/3 

B  aA  sec  6  -  yk  '  sec  '  6  -  6Z  A  '  sec  6  sec  '  6 

a  a  z  a 

-  B5^A"^  sec"^0  +  sec"^^^0  +  E 

a  a  shell 


(254) 


Equation  (59)  gives  0^  =  6a(02,0n>Z,A)  so  that  in  all  further  calculations  it 
should  be  understood  that  6g  is  not  really  an  independent  variable.  The  approx¬ 
imation  ©2  ©u  allows  equation  (254)  to  be  written  as 


2/3  2/3  2  -1/3  2  -1/3 

B  aA  sec  6  -  sec  ^6  -  6Z  A  sec  0  sec  '^0 

a  a  z  a 

-  6(N  -  Z)^a"^  sec^0  sec’^e  +  PpA"^^^  sec"^'^^©  +  E  ^ 

z  a  a  shell 


(255) 
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The  further  approximation  Sr  -v  0^  -v.  0^  'v.  0^  allows  equation  (253)  and  (254)  to 
be  written  for  actinide  nuclei  as 


+  0  0  +  2/30 

1 

>  o 

ay  a 

'''  0. 

+  5/30  -v*  0^  +  0 

6 

a  6  a 

-v.  0 

-  3/40  -v.  0  ,  ,, 

P 

a  Eshell 

oA  ; 

2/3 

sec  0  -  yA  sec 

(256) 


-  6(N  -  Z)V^  sec  e  +  sec'^^^e  +  E  ^ 

a  a  shell 


(257) 


which  are  valid  in  the  valley  of  beta  stability.  For  actinides  at  the  fission 
condition  03  'V'  202  equation  (64)  gives  the  approximations  6n  2.602 
0n  1.30a  • 

Each  term  in  the  expression  for  the  binding  energy  and  average  binding 
energy  per  nucleon  of  actinide  nuclei  will  now  be  considered  separately  with 
the  exception  of  the  nuclear  shell  effects  which  are  more  complicated  and  are 
not  considered  in  this  paper. 

a.  Volume  Energy  Term  for  Actinide  Nuclei. 

The  volume  energy  terms  are  written  for  actinide  nuclei  as 


E  ■  Si  «  oa  exp[i(0  +  6  )] 
v  '  a  a 

a  aA  sec  0  exptj(6  +  0  )1 
a  '  a  a 


(258) 


£  *5=0  exp(J0  )  (259) 

The  volume  energy  per  nucleon  5  describes  the  energy  per  nucleon  of  infinite 
nuclear  matter  but  evaluated  at  the  central  density  of  a  nucleus.^® 

b.  Surface  Energy  Term  for  Actinide  Nuclei. 

The  surface  energy  terms  are  written  as 

Eg  =  ya^^^  =  exp[j(9^  +  2/30^)]  (260) 

=  yA^^^  sec^^^e  exp(j(0  +  2/30  )] 

3  I  o 

e  =  =  ya  exp[j(0  -  1/30  )]  (261) 

s  Y  a 

=  yA”^^^  sec  ^^^0  exp[j(0  -  1/30  )] 

3  I  ® 
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The  complex  number  surface  energy  coefficient  can  be  written  in  terms  of  the 
central  density  of  an  atomic  nucleus. 

c.  Coulomb  Energy  Term  for  Actinide  Nuclei. 

The  complex  number  Coulomb  oiergy  terms  are  written  as 

E  -  exp[j(e.  +  26  -  1/36  )]  (262) 

c  o  z  a 

-  sec^6  8ec"^^^6  exp[j(6-  +  26  -  1/36  )] 

z  a  6  z  a 

-  6z^/i^^^  -  6zV^^^  explj(6^  +  26^  -  4/36^)]  (263) 

-  sec^e  sec^'^'^e  exptj(e,  +  26  -  4/36  )] 

Z  d  0  Z  A 

The  complex  number  Coulomb  energy  coefficient  can  be  written  as  a  generalization 
of  the  standard  scalar  result^ 

6  =■  3/5(e^/b)  =  0,863/b  MeV  (264) 

»  (0. 863/1. 523)k  MeV 
c 

where  b  ^  complex  number  radius  parameter  defined  in  equation  (226).  Equation 
(264)  is  equivalent  to 

6  =  0.863/b  =  (0. 863/1. 523)k  MeV  (265) 

c 

6-  =  -  8,  =  6,  (266) 

6  b  kc 

For  simplicity  it  will  be  assumed  that  the  wave  number  k^  of  the  central  density 
of  an  atomic  nucleus  is  approximately  equal  to  the  wave  number  of  infinite  nu¬ 
clear  matter  ky  ,  so  that 

k  'V  kp  =  1.35  fm"^  (267) 

c  F 

but  in  fact  k.  is  slightly  larger  or  smaller  than  kp  due  to  Coulomb  and  surface 
forces.'® 

d.  Symmetry  Energy  Term  for  Actinide  Nuclei. 

The  complex  number  symmetry  energy  terms  are  written  as 

Esym  =  =  85 exp[j(eg  +  26^  -  8^)]  (268) 

=  ^  sec  exp[j(8g  +  26^  -  6^)] 
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(269) 


Sy»  “  exp(j(0g  +  26^  -  20^)] 

■  6C^a“^  sec~^e  exp(j(0  +  20.  -  20  )] 
a  p  ^  a 

where  i  ,  C  and  0^  are  given  by  equations  (244)  through  (246)  respectively. 
Equations  (268)  and  (269)  can  be  simplified  by  assuming  the  approximation 


@2  0n  •  then  eqtiations  (245)  and  (246)  give 

C  (N  -  Z)sec  0^  ®z  ®n  (270) 

and  equations  (268)  and  (269)  become 

i  ^  6(N  -  Z)V^  sec"^0  sec\  exptj(6g  +  20,  -  0,)]  (271) 

Sjno  o  z  p  z  a 

e  -x.  B(N  -  Z)^a"^  sec“^0  sec^0  exptj(0-  +  20  -  20  )]  (272) 

sym  a  B  z  a 

As  a  further  approximation  let  0z  6n  '''  ^a  »  which  follows  from  equation  (59) 
for  6z  0n  >  then  equations  (271)  and  (272)  become 

Egym  6(N  -  Z)V^  sec  0^  exp[j(6g  +  0^)1  (273) 

-  Z)V^  exp(j0-)  (274) 

syin  p 


Equations  (273)  and  (274)  are  valid  in  the  vicinity  of  the  valley  of  beta  sta¬ 
bility  where  Qz  ©n  '''  0a  •  Th®  complex  number  symmetry  energy  coefficient 
can  be  written  in  terms  of  the  central  density  of  an  atomic  nucleus. 

e.  Pairing  Energy  Term  for  Actinide  Nuclei. 

The  complex  number  pairing  terms  are 

Epair  •  Pp/5^^^  -  Ppa"^^^  exp[j(ep  -  3/40^)]  (275) 

=  PpA  sec  exp[j(6p  -  3/49^)] 

^pair  ”  *  PP®  exp[j(ep  -  7/40^)]  (276) 

■  PpA  sec  ^^^0  exptj(9  -  7/49  )] 

a  pa 

The  shell  energy  term  is  more  complicated. 

c.  Measured  Binding  Energies  of  Actinide  Nuclei  Located  in  an 
External  Field. 
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The  real  and  Imaginary  parts  of  equation  (236)  are  given  by 

2/3  2/3 

B  cos  e  -  aA  sec  e  cos(0  +  6  )  -  yh  sec  '  6  cos(6  +  2/36  )  (277) 

o  d  01  S  3  Y  d 

2  -1/3  2  -1/3 

-  6Z  A  '  sec  6  sec  '  0  cos(0j,  +  20  -  1/36  ) 

z  a  6  z  a 

-  sec“^0  cos(6o  +20,-6  ) 

a  6  5  a 

+  PpA  sec  ^^^6  cos(0  -  3/40  )  +  E  ,  ,,  cos  6-  . 

a  '  p  a'  shell  Eshell 

2/3  2/3 

B  sin  6  =  aA  sec  6  sin(6  +  6  )  -  yk'  sec  '  b  sin(6  +  2/36  )  (278) 

X>  3  Ol  3  3  Y  ^ 

2  -1/3  2  -1/3 

-  6Z  A  '  sec  6  sec  '6  sin(6^  +  26  -  1/30  ) 

Z  3  0  Z  3 

-  sec“^0  sin(6-  +  26  -  6  ) 

a  B  4  3 

+  sec~^^‘*Q  sin(0  -  3/46  )  +  E  .  sin  0„  . 

a  '  p  '  shell  Eshell 


Equations  (277)  and  (278)  immediately  determine  B  and  Bg  ,  The  measured  bind¬ 
ing  energy  is  just  the  real  part  of  the  complex  number  binding  energy,  so  that 


B  =  a  A  -  Y  A^^^  -  6  -  B'C^a"^  +  E™  .  +  e“  . 

m  m  m  m  ro  pair  shell 


(279) 


where  for  actinide  nuclei 


a  =  a  sec  6  cos(0  +  6  ) 
m  a  a  a 


(280) 


y  -  y  sec  0  cos(0  +  2/36  ) 
m  a  Y  a 

2  -1/3 

6  “6  sec  0  sec  6  cos(0r  +20  -  1/36  ) 
m  z  a  o  z  a 


(281) 


(282) 


B'  =  B  sec  0  cos(0„  +  20_  -  6  ) 

ID  3  p  ^  3 

E™  ,  =  PpA  sec  ■'^^6  cos  (6  -  3/40  ) 

pair  a  pa 


(283) 


(284) 


where  ^  =  5(Z,N,02,6n)  ^nd  is  defined  by  equation  (245),  and  6^  =  6^(Z,N,02,0n) 
is  defined  by  equation  (246) .  The  internal  phase  angle  6a  is  given  by  equation 
(59)  to  be  0a  =  0a(Z,N,62,0n)  • 


If  the  small  angle  approximation  0^  'v  0^  “v  6^  is  assumed,  then  the  approx¬ 
imations  in  equation  (270)  and  (271)  allow  equation  (279)  to  be  written  as 


U  A  a2/3  r  -2  -1/3  .  v\2a~1  j.  U®  j.  p® 

B  “oA-yA  -oZA  -6(N-Z)A  +E  ,  +£,,, 

m  m  m  m  m  pair  shell 


(285) 
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where  now 


2  -1 

6  ■  S  sec  0  sec  0  cos(0-  +20  -  0  ) 

B  z  a  0  z  a 

If  the  further  small  angle  approximation  0^  ‘v  02  0^  is  made, 

(285)  is  the  measured  binding  energy  with 

a  -a  sec  0  cos (6  +  0  ) 

m  a  a  a 

Y  ■  Y  sec^^^0  cos(0  +  2/30  ) 

m  a  Y  a 

5/3 

6  “6  sec  0  cos(0,  +  5/30  ) 

m  a  6  a 

6  “  0  sec  0  cos(0„  +  0  ) 

m  a  0  a 


(286) 

then  equation 

(287) 

(288) 

(289) 

(290) 


which  are  useful  for  nuclei  near  the  valley  of  beta  stability.  The  measured 
values  of  the  symmetry  energy  coefficients  are^®»^® 

a  -  15.5  Y  =  17.2  6  =  0.698  0  =  23.3  MeV  (291) 

m  m  m  m 

Equations  (287)  through  (290)  show  that 

a<a  y  <  y  6  <  6  B<0  (292) 

m  m  ro  m 

The  values  of  the  nuclear  mass  formula  parameters  02  >  >  ^a  >  >  T  > 

9y  »  B  and  0g  can  be  obtained  by  fitting  the  real  part  of  the  complex 

number  binding  energy  given  by  equation  (279)  to  the  measured  values  of  the 
atomic  masses  of  the  actinide  elements.  A  simplified  procedure  uses  the  approx¬ 
imation  02  '''  0n  equation  (285)  for  the  fit  to  atomic  masses.  Expressions 
for  the  atomic  masses  of  the  elements  will  now  be  considered. 


D.  Masses  of  Actinide  Atoms  Located  in  an  Electromagnetic  or 
Gravitational  Field. 

The  conventional  relationship  between  atomic  mass  and  nuclear  binding 
energy  is  written  as^® 

M  =  Znij^  +  Nm^  -  B  (293) 

where  M  =  atomic  mass  of  an  element,  m^  =  mass  of  hydrogen  atom  and  m^  =  neu¬ 
tron  mass.  In  the  presence  of  an  external  field  the  atomic  mass  is  a  complex 
number  in  an  internal  space  and  is  given  by^® 

M  =  zm^j  +  nm^  -  B  (294) 

where  the  complex  number  atomic  mass  is  written  as 
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M  -  M  exp(je^) 


(295) 


Using  equations  (51)  and  (52)  allows  the  real  and  imaginary  parts  of  equation 
(29A)  to  be  written  as 


M  cos  *  G 

(296) 

M  sin  0„  «=  F 
n 

(297) 

where  for  actinide  nuclei 

G  =  nijjZ  +  m^N  -  B  cos  0g 

F  *  m^Z  tan  6  +  m  N  tan  6  -  B  sin  6_ 

^  2  n  n  B 

Equations  (296)  through  (299)  can  be  used  to  obtain  M  and  6m  as 

tan  0„  =  F/G  (300) 

M 

2  2  2 

M  =  F  +  G  (301) 

The  measured  atomic  mass  for  the  actinide  elements  is  given  by  equations  (296) 
and  (298)  which  can  be  rewritten  as 

M  =nL,Z+mN-B  (302) 

m  H  n  m 

where  Bj^  is  given  by  equations  (279)  or  (285).  The  small  angle  approximation 
9z  '''  ®n  9a  '''  combined  with  equations  (294)  through  (302)  gives  for  ac¬ 
tinide  nuclei 


(298) 

(299) 


0  -v  e  Q  %  0  -v  0„  (303) 

M  z  n  a  B  '  ' 

M  '''  (®jjZ  +  m^N)sec  0^  -  B  (304) 

where  63  is  given  by  equation  (63)  within  this  approximation.  Note  that  M  , 

Mm  and  0m  vary  with  the  strength  of  the  applied  electromagnetic  field  because 
0z  =  6z(H)  in  equations  (300)  through  (302),  but  the  following  intrinsic  mass 
is  a  constant  independent  of  the  applied  electromagnetic  (or  gravitational) 
field 


nij^Z  +  m^N  =  constant  (305) 

and  represents  the  universal  law  of  the  conservation  of  rest  mass  and  baryon 
number . 

The  variation  of  the  measured  atomic  mass  and  the  magnitude  of  the  atomic 
mass  with  the  strength  of  the  external  electromagnetic  field  can  be  obtained 
from  equations  (301)  and  (302)  by  the  following  formulas 


FIO 


dH  /dH  -  3H  /ae  de  /dH  +  3M  736  de  /dH  +  3M  736  dO  7dH  (306) 

n  azz  mnn  maa  ' 


dM7dH  -  3M73d  de  7dH  +  3M73e  de  7dH  3M73e  de  7dH 
z  z  n  n  a  a 


(307) 


The  internal  phase  angle  e^  -  6a(®z»®n»^»^^  given  by  equation  (59)  so  that 


the  derivative  deg7dH  can  be  evaluated  as 


de  7dH  -  3e  73e  de  7dH  +  ae  73e  de  7dH 
a  azz  a  n  n 


(308) 


Therefore  quatlons  (306)  and  (307)  can  be  written  as 

dM  7dH  =  (3M  73e  +  3M  736  36  736  )de  7dH 

m  m  z  m  a  a  z  z 

+  (3M  736  +  3M  736  36  736  )de  7dH 

m  n  m  a  a  n  n 


(309) 


dM7dH  -  (3M73e  +  3M73e  36  736  )de  7dH 

z  a  a  z  z 

+  (3M/38  +  aM/ae  ae  /de  )de  7dH 
n  a  a  n  n 

where  for  example  equation  (302)  gives  for  actinide  nuclei 


(310) 


3M  736  =  -  3B  736 

m  z  m  z 

3M  736  =  -  3B  730 

m  n  m  n 

3M  736  =  -  3B  736 

m  a  m  a 


(311) 

(312) 

(313) 


where  Bjj,  is  given  by  equation  (279). 

For  the  approximate  case  6^  6^^  “v  0^  ,  which  follows  frcas  eqxiation  (59) 

for  small  arguments,  it  follows  that 


dM  7dH  -  dM  7de  d0  7dH 
m  m  a  a 

dM/dH  =  dM7d0  d0  7dH 
a  a 


(314) 

(315) 


where  equations  (302)  and  (304)  give  for  6z  0n  '''  6a  and  for  actinide  nuclei 


dM  7d6  =  -  dB  7d6 

m  a  m  a 


dM/d0^  =  (m^Z  +  m^N)sec  0^  tan  0^  -  dB7d0^ 


(316) 

(317) 


where  the  approximate  value  of  B^  given  by  equations  (285)  and  (287)  through 
(290)  are  used  in  conjunction  with  equation  (316),  while  the  approximate  value 
of  B  given  by  equation  (257)  is  used  in  conjunction  with  equation  (317).  There¬ 
fore  from  equation  (285)  and  within  the  approximation  '''  0n  '''  6a  ,  the  deriv¬ 

ative  in  equation  (316)  is  given  by 
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dB  /de  -  da  /dO  A  -  dy  /dO  -  d6  /dO 

mania  ma  ma 

-  dB  /d6„  (N  -  Z)^A~^  +  dE®  ,  /d6  +  dE®.  .JdQ 

m  a  '  '  pair  a  shell  a 


(318) 


where  from  equations  (287)  through  (290)  for  Qz  '''  ®n  '''  ®a  follows  that  for 
actinide  nuclei 


do  /d0  «  -  a  sec  9  cos(9  +  0  )[tan(0  +  0  )  -  tan  0  ] 

m  a  a  a  a  a  a  a 


(319) 


dy  /d0  =  -  2/3y  sec  0  cos(0  +  2/30  )[tan(b  +  2/30  )  -  tan  0  ]  (320) 

ni3  ^ 

5/3 

d6  /d0  -  -  5/36  sec  '0  cos(0-  +  5/30  )[tan(0-  +  5/39  )  -  tan  0  ]  (321) 

ma  a6a6a  a 


dB  /d0  “  -  B  sec  0  cos(0o  +  0  )[tan(0o  +  0  )  -  tan  0  ] 

tD  d  d  P  3  p  d  3 


(322) 


The  derivative  of  the  measured  pairing  energy  that  appears  in  equation  (318)  is 
obtained  from  equation  (284)  for  actinide  nuclei  to  be 

dE®  .  /d0  =  3/4PpA"^^^sec"^'^^0  cos(0  -  3/49  )[tan(0  -  3/40  )  -  tan  0  ]  (322A) 

pair  a  a  p  a  p  a  a 

For  the  approximation  02  =  0  =0  ,  the  derivative  of  the  magnitude  of 

the  binding  energy  for  actinide  nuclei  that  appears  in  equation  (317)  is  ob¬ 
tained  from  equation  (257)  to  be 


dB/d0  =  sec  9  tan  0  [aA  -  2/3yA^^^  sec  ^^^0 

3  3  3  3 

-  5/36Z^a"^^^  sec^^^O  -  B(N  -  Z)^a"^ 

3 

-  3/4PPA-3/''  H. 


(323) 


From  equations  (316) 

through  (323)  it  follows 

dB  /d0  <  0 

dB/d0  >  0 

m  a 

a 

dM  /d0  >  0 

dM/dO  >  0 

m  a 

a 

For  example,  within 

the  approximation  O2  0n 

(324) 

(325) 


give  for  actinide  nuclei 

2/3  -1/3 

dM/d0^  =  sec  0^  tan  0^  -  aA  +  2/3yA  sec  0^ 


(326) 


+  5/36Z^a"^'^^  sec^^^0  +  B(N  -  Z)^a"^  +  3/4PpA~^^^  sec“^^^0  ] 

3  d 


-  dE  . 

shell  a 
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which  is  a  positive  number  because  of  Che  dominant  contribution  of  the  rest 
mass  terms. 


E.  Valley  of  Beta  Stability  for  Nuclei  with  Broken  Internal  Symmetries. 


Radioactive  beta  decays  require  that  the  complex  atomic  number  z  adjusts 
itself  so  as  to  minimize  the  binding  energy  of  a  nucleus  given  by  equation 
(236)  but  subject  to  the  constraints  represented  in  equations  (55)  and  (62).^'* 
Combining  equations  (62)  and  (236)  gives  after  neglecting  shell  and  pairing 
energy  effects 


B  =  ai  -  ya^'^  -  -  6(a  -  2z)^/i 


(i27) 


The  minimum  binding  energy  condition  is 

3B/3z|_  =  -  26z/a^^^  +  46(i  -  2z)/i  -  0 
a 


(328) 


which  gives  using  equation  (62) 

-vs  -/O'  .  — 2/3. -1 

z  =  a/2i/  TT  ca  ) 

-vs  -/0/1  .  o — 2/3.,,  — 2/3. -I 

n  =  a/2(l  +  2ca  )(T  ca  ) 


(329) 

(330) 


where  vs  =  valley  of  beta  stability,  and  where 


c  =  6/(4B) 


c  =  6/(46) 


®c  “  ®6  -  % 


(331) 


For  medium  weight  atomic  nuclei  the  following  approximation  to  equations  (329) 
and  (330)  can  be  used 


z''®  .  1/2(1  -  cl^'h 

»  1/2(1  + 


(332) 

(333) 


where  c  is  given  by  equation  (331). 

—VS 

For  heavy  nuclei  the  exact  equation  (329)  must  be  used  to  calculate  z 
Equation  (329)  can  be  written  as 


=  (G  +  jF)/D 


tan  0^®  =  F/G 
z 


,  (o2  ^  p2^1/2^j, 


(334) 

(335) 


where 


G  =  a/2  cos  0  [1  +  ca^'^  cos(0  +  2/30  )]  +  c/2  sin  0  sin(0  +  2/30  )  (336) 


F  =  a/2  sin  0  [1  +  cos(0  +  2/30  )]  -  c/2  a^^^  ■  s  0^  sin(0^  +  2/30  )  (337) 

s  c  3  a  c  d 


D  =  [1  +  ca^^^  cos(0  +  2/30  )]^  +  c^a^^^  sin^(0  +  2/30  ) 


(338) 
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Combining  equations  (46)  and  (336)  through  (338)  gives  for  actinide  nuclei 

G  -  A/2  [1  +  cA^^^  sec^^^e  cos(e^  +  2/36  )]  (339) 


+  c/2A^^^  sec^^^e  tan  0  sin(6  +  2/36  ) 
a  a  c  a 


F  -  A/2  tan  6  [1  +  sec^^^6  cos (6  +  2/36  )] 

a  a  c  a 


-  c/2A^^^  sec^^^6  sin(e  +  2/36,) 
a  c  a 


(340) 


D  =  [1  +  cA^^^  sec^^^6  cos (6  +  2/36  )]^ 

a  c  a 

+  c^A^^^  sec^^^6  sin^(e  +  2/36  ) 
a  c  a 


(341) 


If  A  and  6^  are  taken  to  be  the  known  quantities,  then  equation  (54)  involves 
five  unknown  quantities  W  ,  Z  ,  N  ,  62  and  6^^  .  The  complex  number  equation 
(54)  and  the  scalar  equation  (55)  supply  three  equations  for  determining  the 
five  unknown  quantities.  The  complex  number  valley  of  beta  stability  equation 
(328)  supplies  two  additional  equations  and  so  a  complete  solution  is  possible. 
From  equations  (335)  and  (339)  through  (341)  that  describe  the  valley  of  beta 
stability  for  the  general  case  of  nuclei  of  arbitrary  size  it  follows  that  in 
general  for  actinide  nuclei  W  =  0  and 


6^® 

z 


,vs 


=  z''®(A.e^) 


(342) 


Combining  equations  (46)  and  (355)  gives  for  actinide  nuclei 

Z  =  z  cos  0  =  G/D  (343) 

z 

=  A  -  »  A  -  G/D  (344) 

VS 

The  value  of  6^^  for  the  valley  stability  of  actinide  nuclei  is  obtained  ex¬ 
actly  from  equation  (62)  which  is  written  in  the  form  with  W  =  0 


_VS  _  -VS 

n  =  a  -  z 


(345) 


from  which  it  follows  in  accordance  with  equation  (60)  that  for  actinide  nuclei 

(346) 


tan  0^^®  =  (A  tan  6  -  z'^®  tan  6'^®)/(A  -  Z) 

n  a  z 


VS  vs 

where  and  Z  are  given  by  equations  (335)  and  (343) .  The  measured  values 
of  the  atomic  number,  neutron  number  and  atomic  mass  number  in  the  valley  of 
beta  stability  are  given  for  the  actinides  by 


vs 

z 

m 


,vs 


vs  „vs 
n  =  N 
m 


a 

m 


A 


(347) 
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which  are  integers. 

5.  SUPPRESSION  OF  LOW  ENERGY  BINARY  FISSION  OF  THE  FISSILE  ACTINIDES  BY 
AN  EXTERNAL  FIELD.  This  section  determines  the  conditions  necessary  for  the 
inhibition  of  spontaneous  or  thermal  neutron  induced  binary  fission  in  fissile 
actinide  nuclei  that  are  located  in  an  electromagnetic  or  gravitational  field. 
The  Bohr-VJheeler  fission  instability  condition  is  generalized  to  the  case  of 
actinide  nuclei  located  in  an  external  field.  This  condition  is  used  to  de¬ 
termine  the  critical  internal  phase  angle  of  the  atomic  number  which  corre¬ 
sponds  to  the  suppression  of  spontaneous  or  thermal  neutron  induced  binary 
fission  by  an  external  field.  An  expression  for  the  corresponding  critical 
static  magnetic  field  that  is  required  to  suppress  binary  fission  in  the  fissile 
actinides  is  presented  and  numerical  values  are  obtained  for  this  critical 
static  magnetic  field  In  terms  of  the  fissllity  parameters  for  several  actinide 
nuclei. 


A.  Bohr-Wheeler  Fission  Condition  for  Actinide  Nuclei  in  an 
Electromagnetic  Field. 

The  standard  Bohr-Wheeler  analysis  for  spontaneous  or  thermal  neutron  in¬ 
duced  nuclear  fission  utilizes  the  flssility  parameter  which  is  defined  by*^*^** 

X  =  (Z^/A)(ky/6)'^  (348) 

and  the  spontaneous  and  thermal  neutron  induced  fission  condition  is  written 


X  >  1 


Z^/A  ^  ky/6 


where  y  and  6  =  surface  and  Coulomb  energy  coefficients  that  appear  in  the  liq¬ 
uid  drop  nuclear  mass  formula  treated  in  Seccion  4,  and  where  theoretically  for 
spontaneous  fission 

K  =  g/h  =  2  (350) 

where  g  =  2/5  and  h  =  1/5  are  the  second  order  series  expansion  coefficients  of 
the  surface  and  Coulomb  energies  respectively  when  these  terms  are  expanded  in 
terms  of  an  ellipsoidal  deformation  parameter.  The  values  of  ic  ,  y  and  6 

along  with  the  other  mass  formula  parameters  are  determined  empirically. 

The  values  of  k  are  different  for  spontaneous  and  for  thermal  neutron  induced 
fission,  and  in  fact  k  is  dependent  on  the  energy  of  the  incident  neutrons?^  **'* 
For  thermal  neutron  induced  f ission^^~'*‘‘ 


K  -v  1.471 


(351) 


Choosing  y  =  17.2  MeV  and  6  =  0.698  MeV  yields  the  following  fission  conditions 
for  a  zero  value  of  the  externally  applied  field^®“‘*‘* 


Z  /A  >  49.28 
Z^/A  >  36.25 


spontaneous  fission 

thermal  neutron  induced  fission 


(352) 

(353) 
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These  inequalities  show  that,  loosely  speaking,  only  the  actinides  and  trans¬ 
actinides  can  undergo  spontaneous  or  thermal  neutron  induced  fission,  but  not 
all  of  these  heavy  elements  undergo  fission.  Within  this  group  of  heavy  ele¬ 
ments  the  more  neutron  rich  isotopes  tend  to  be  more  stable  against  fission, 
for  example  is  stable  against  thermal  neutron  Induced  fission  but  is 

fissile.  The  empirical  value  of  <  that  describes  thermal  neutron  induced  fis¬ 
sion  will  depend  on  the  values  selected  for  the  mass  formula  parameters  y  and 
6  .  In  general  k  can  be  taken  to  be  a  decreasing  function  of  the  kinetic  ener¬ 
gy  of  the  incident  neutrons.  The  fission  criteria  presented  above  ignore  all 
shell  structure  effects  and  are  therefore  approximate  relations  which  show  only 
general  behavior  and  for  which  counterexamples  can  always  be  found  in  the  bor¬ 
der  region  between  fissile  and  non-fissile  nuclei. 

The  generalization  of  equation  (3A9)  to  the  case  of  atomic  nuclei  located 
in  an  electromagnetic  or  gravitational  field,  which  breaks  the  symmetry  of  the 
atomic  number,  neutron  number  and  atomic  mass  number,  can  be  written  as^^ 

z^/a  >  <y/Z  (35A) 

where  z  ,  a  ,  y  and  6  are  given  by  equations  (43),  (45),  (237)  and  (238)  re¬ 
spectively.  The  fission  instability  boundary  is  given  by 

z^/a  =  Ky/Z  (355) 

or  equivalently  the  two  scalar  fission  stability  boundary  conditions  are^® 

z^/a  =  <y/'5  (356) 

6  =26  -  0  +0-  (357) 

a  z  Y  6 

Therefore  in  an  external  field  the  internal  phase  angles  of  the  atomic  number, 
atomic  mass  number,  surface  energy  coefficient  and  the  Coulomb  energy  coeffi¬ 
cient  enter  into  the  fission  instability  condition.  Equations  (356)  and  (357) 
will  be  solved  to  determine  the  critical  value  of  0^  that  is  required  to  sup¬ 
press  spontaneous  or  thermal  neutron  induced  binary  fission  in  the  actinides 
by  the  application  of  an  external  electromagnetic  field. 

B.  Critical  Value  of  the  Internal  Phase  Angle  of  the  Atomic 

Number  that  is  Required  to  Suppress  Low  Energy  Binary  Fission 
of  Fissile  Actinide  Nuclei  Located  in  an  Electromagnetic  Field. 

Combining  equation  (46)  with  equations  (356)  and  (357)  gives  the  binary 
fission  instability  boundary  for  actinide  nuclei  located  in  an  external  field 
as 

Z^/A  =  Ky/6  sec  0  sec  ^0  (358) 

sl  z 

=  ky/6  sec(20  +  0  -  6  )sec  ^0 

z  6  Y  z 

Equation  (358)  must  be  solved  for  6  .  This  can  be  done  by  noting  that  simple 

trigonometry  gives^^ 
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where 


cos(2e^  +  6.-0  )cos“^e.  -  (1  -  p^)co8(0  -  Bj,)  +  2p  8in(0  -  0.)  (359) 

z  0  y  z  y  o  y  o 


Can  0 


(360) 


Equation  (358)  can  Chen  be  written  as  a  quadratic  equation 


ap  +  bp  +  c 


(361) 


where 


a  «  cos(0^  -  0^) 

b  -  2  sin(0^  -  0^)  -  -  2  sin(0^  -  0^) 


(362) 

(363) 


c  -  (ky/6)(A/Z  )  -  cos(0^  -  0g) 


-  cos(0^  -  0^) 


(36A) 


where  the  fissility  parameter  x  is  given  by  equation  (AO).  Then  the  critical 
angle  of  the  atomic  number  for  binary  fission  suppression  0^  is  given  by 


11/2 

tan  0^  *  tan(0^  -  0^)  t  sec(0^  -  ®^)li  “  (•cy/6)(A/Z  )cos(0^  -  0^)] 

-1  1/2 

-  taa(e  -  0^)  +  sec (6^  -6^)11  -  x  *  cos(e  “  6^)]  ' 


(365) 


and  from  eqiiatlon  (357)  the  corresponding  critical  angle  of  the  atomic  mass 
number  for  fission  suppression  0|  is  given  by 


0®  =  20^  +  6.  -  e 

a  z  P  Y 


(366) 


s  s 

The  angles  0^  and  63  are  the  critical  values  of  the  phase  angles  6^  and  0^ 
that  are  required  to  suppress  thermal  neutron  induced  binary  fission  in  an  ac¬ 
tinide  nucleus.  In  other  words,  0^  >  6|  and  0^  >  of  are  required  for  binary 
fission  suppression  in  an  actinide  nucleus  by  the  presence  of  an  external 
field.  Figure  1  gives  0|  and  Figure  2  gives  0|  in  terms  of  the  fissility  pa¬ 
rameter  for  actinide  nuclei  with  the  choice  of  phase  angle  values  0y  =  O.Ar 
and  05  =  O.lr  whose  values  are  selected  only  for  demonstration  purposes. 

Equation  (365)  is  the  equation  for  the  instability  boundary  for  the 
suppression  of  thermal  neutron  induced  fission  of  an  actinide  nucleus  (Z,A), 
and  is  valid  for 


<y/6  cos(0^  -  0g)  ^  Z  /A  ^  “ 


(367) 


or  in  terms  of  the  fissility  parameter 


cos(0^  -  0g)  <  X  ^ 


(368) 
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for  X  ^  cos(9y  -  65)  as  shown  in  Figures  1  and  2. 

The  condition  for  the  suppression  of  thermal  neutron  induced  binary  fis¬ 
sion  in  the  actinides  by  an  external  electromagnetic  field  can  also  be  written 
in  an  alternative  form  to  equation  (365)  as  follows 

cos(e^  -  0^)  <  X  <  (379) 

where  x  =  fissility  parameter,  and  where 
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(3«0) 


F  -  (1  -  X^)8ec(e^  -  0^) 

\  -  I  tar  6^  -  tan(e^  -  0g)lco8(e^  -  0^)  (3*1) 

where  In  general  F  <  1  .  Equatlona  (365)  and  (379)  are  equivalent  to  the  fol¬ 
lowing  binary  fission  suppression  condition 

0,  >  0®  (382) 

z  z 

S 

where  given  by  equation  (365)  in  terms  of  x  •  Equation  (379)  gives  the 

range  of  flssillty  parameters  for  which  thermal  neutron  induced  binary  fission 
is  suppressed  by  an  external  field  which  generates  the  phase  angle  .  It  is 
clear  from  equations  (379)  and  (382)  that  in  the  presence  of  an  electromagnetic 
field  thermal  neutron  Induced  binary  fission  can  occur  only  for  actinide  nuclei 
for  which 

X  >  0^  <  6^  (383) 

Equation  (383)  is  the  Bohr-Wheeler  binary  fission  condition  for  actinide  nuclei 
in  an  external  field  for  which  F  <  1  .  The  phase  angle  62  represents  a  bound¬ 
ary  between  the  regions  of  thermal  neutron  Induced  binary  fission  and  binary 
fission  suppression  of  the  actinides  in  an  electromagnetic  field.  If  the  ex¬ 
ternal  field  is  shut  off  all  of  the  internal  phase  angles  have  zero  values  and 
A  «  0  and  F  >=  1  so  that  equation  (379)  reduces  to  the  statement  that  under  zero 
field  conditions  there  are  no  nuclei  for  which  binary  fission  is  suppressed  be¬ 
cause  the  binary  fission  suppression  range  given  by  equation  (379)  shrinks  to 
zero  length  about  x  *  1  »  and  equation  (383)  reduces  to  the  standard  Bohr- 
Wheeler  fission  condition  given  by  equation  (349). 

As  a  first  approximation  the  condition  Oy  =  65  can  be  taken  in  equation 
(366)  and  the  phase  angle  condition  for  the  suppression  of  spontaneous  or 


thermal  neutron  induced  binary  fission  in  an  external  field  is 

ef  =  20®  -v  0.760®  0®  -v  2.630®  0®  -v-  5.410®  (384) 

a  z  n  n  z  5  z 

Combining  equations  (358)  and  (384)  gives  the  approximate  binary  fission  sup¬ 
pression  boundary  for  actinide  nuclei  in  an  external  field  as 

Z^/A  =  (<y/6)(1  -  tan^0®)~^  (385) 

z 

X  =  (1  -  tan^0®)"^  (386) 

or  equivalently  as 

Z^/A  =  (ky/6)[1  -  tan^(0®/2)]‘^  (387) 

X  =  [1  -  tan^(0®/2)]"^  (388) 

3l 
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For  Che  case  of  the  approxinaclon  Qy  -  Che  condlcion  for  binary  fission  sup¬ 
pression  In  fissile  acClnlde  nuclei  by  an  exCemal  field  which  Induces  a  phase 
angle  62  In  Che  acomlc  number  Is  given  by  equaclons  (379)  and  (382)  wlch 
X  ■  Can  02  as 

1  <  (Z^/A)(<y/«)'^  <  (1  -  Can^e  )'^  6  >  e®  (389) 

z  z  z 

or  equlvalenCly 

1  <  X  <  (1  -  tan^e  )"^  6  >  0®  (390) 

z  z  z 

which  follows  dlrecCly  from  equaclons  (379)  Chrough  (381)  when  0y  -  0^  .  Equa- 
Clon  (390)  decermlnes  che  nuclei  region  of  suppressed  binary  fission  wichln  Che 
approxlmacion  0y  “  06  .  and  corresponds  Co  Che  exacC  region  of  suppressed  bi¬ 
nary  fission  ChaC  is  specified  by  equaclons  (379)  Chrough  (382).  Wichln  Che 
approxlmacion  0y  ■  06  the  condlcion  for  chermal  neucron  Induced  binary  fission 
of  the  acclnldes  In  an  elecCromagnetlc  field  Is  obcalned  from  equaclon  (390)  as 

X  >  (1  -  can^0  )"^  0  <  6^  (391) 

z  z  z 


which  follows  from  equaClon  (383)  for  Che  case  0y  =  06  •  and  which  reduces  Co 
the  standard  Bohr-Wheeler  fission  condition  given  by  equation  (349)  for  the 
case  of  zero  external  field.  Equaclons  (386)  and  (388)  determine  0|  and  0|  in 
terms  of  x  for  the  approximation  ©y  ®  65  . 


Equations  (385)  and  (386)  show  that  within  the  approximation  6y  85 
internal  phase  angle  of  the  atomic  number  that  is  required  to  suppress  chermal 
neutron  induced  binary  fission  in  an  actinide  nucleus  (Z,A)  is  given  by 


tan  0^  =  ±  [1  -  (ky/6)(A/Z^)]^^^ 

1  (1  -  x-^)^/^ 


(392) 


=  ±  [(x  -  i)/x]^^^ 


where  x  ^  1  •  As  shown  in  Figure  3  equation  (392)  has  positive  and  negative 
modes.  Equation  (392)  can  be  obtained  directly  from  the  exact  equation  (365) 
by  making  the  approximation  0y  *  06  •  The  approximation  Qy  -  05  is  made  only 
for  convenience  because  this  eliminates  the  values  of  0y  and  06  from  the  cal¬ 
culation  of  ©I  .  If  this  approximation  is  not  made  the  values  of  0y  and  06 
must  be  obtained  from  fitting  a  liquid  drop  type  of  nuclear  mass  formula  to 
measured  atomic  masses.  Equation  (392)  can  also  be  written  as 

cos  0^  «  [2  -  (ky/6)(A/Z^)]"^^^  (393) 


=  (2  -  x-S-^/2 


[(2x  -  D/xl 


-1/2 
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(394) 


sin  6®  -  ±  {[1  -  (icy/6) (A/Z^)l/[2  -  (ky/6>(A/2^)1}'^^ 

-  M(1  -  x"^)/(2  - 

-  ±  l(X  -  l)/(2x  -  1)1^^^ 

Equations  (384)  and  (392)  can  be  written  equivalently  as 

0®  -  ±  tan“^(l  -  (ky/«)(A/Z^)I^^^  6^  -x.  (2  +  Z/N)e®  (395) 

z  n  z 

-  ±  tan"^t(x  -  1)/Xl^^^ 

ef  -  ±  2  tan'^d  -  (KY/fi)(A/Z^)1^^2  qS  ^  2e®/(l  -  Z/N)  (396) 

-  ±  2  tan“^(x  -  l)/xl^^^  =  0^/(1  -  Z/N) 

a 

and  are  plotted  in  Figures  3  and  4  for  dy  >  05  for  actinide  nuclei  with  x  ^  1  • 
The  approximate  equations  (392)  through  (396)  are  valid  for  6y  *  65  and  the  fol¬ 
lowing  range  of  the  flsslllty  parameter 

1  <  X  <  “  (397) 

S 

Within  the  approximation  0^  =  0^  the  range  of  values  of  0^  for  the  positive  and 
negative  modes  is 

-  n/A  <  0®  <  ir/A  (398) 

z 

g 

as  shown  in  Figure  3  for  x  ^  1  •  Th®  range  of  values  of  0a  for  the  positive 
and  negative  modes  is 

-  x/2  <  0^  <  it/2  (399) 

a 

as  shown  in  Figure  4  for  x  ^  1  •  Equations  (398)  and  (399)  correspond  to  the 
exact  relations  given  in  equations  (373)  and  (374)  respectively.  For  the  posi¬ 
tive  angle  modes  the  ranges  of  0|  and  0|  are  within  the  approximation  0y  =  0^ 
given  by 

0  <  0^  <  Tr/4  0  <  0^  <  ti/2  (400) 

z  a 

while  for  the  negative  angle  modes 

-  Tr/4  <  0^  <  0  -  ir/2  <  <  0  (401) 

z  a 

which  correspond  to  the  general  cases  in  equations  (375)  through  (378) .  Values 
of  the  angle  6^  given  by  equation  (395)  for  various  actinide  nuclei  appear  in 
Table  1 . 

C.  Determination  of  the  Static  Magnetic  Field  Required  to  Suppress 
Thermal  Neutron  Induced  Binary  Fission  in  Actinide  Nuclei. 
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The  value  of  the  magnetic  field  required  to  suppress  binary  fission  in 
the  actinides  is  calculated  by  assuming  a  relationship  between  the  magnetic 
field  and  the  internal  phase  angle  of  the  atomic  number  that  is  required  to 
suppress  binary  fission  that  is  analogous  to  equation  (42),  so  that 

tan  e®  H  -  k"  tan  0  (402) 

6z  z  0z  z 

where  *  static  nuclear  magnetic  stiffness  coefficient,  and  where  is 
given  for  the  general  case  by  equation  (365).  Therefore  in  general 

O 

where  the  flssility  parameter  x  is  given  by  equation  (348).  If  0|  is  given  by 
the  approximate  equation  (392)  the  critical  value  of  the  magnetic  field  required 
to  suppress  binary  fission  in  the  actinides  is  given  by 

-  k“^[(x  -  l)/xl^^^  (A04) 


with  X  ^  >  so  that  within  this  approximation 

=  H^(Kq^  ,  x)  (405) 

q 

and  H  depends  on  only  two  parameters.  The  corresponding  value  of  the  static 
magnetic  induction  required  to  suppress  binary  fission  is  written  as 

=  K®  tan  6^  B  =  K®  tan  0  (406) 

tiz  z  oz  z 

=  K®^[(x  -  l)/x]^^^  (407) 

where 


"L  =  “‘'L 


where  y  =  magnetic  permeability  of  nuclear  matter, 
stiffness  coefficients  have  the  values^® 

K®  =  2.36  X  10^^  T 
0z 


Kq  =  1.88  X  10^^  coul/(m  sec) 
wz 


(408) 

The  static  nuclear  magnetic 

(409) 

(410) 


where  T  =  tesla.  The  value  of  the  magnetic  permeability  is  taken  to  be  the 
vacuum  value^® 

M  =  =  4Tr  X  10  ^  kg  m/coul^  (411) 

H  B 

A  complete  discussion  of  the  determination  of  the  coefficients  Ke2  and  K92  has 
appeared  in  the  literature. As  shown  in  equation  (390)  thermal  neutron  in- 
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duced  binary  fission  In  the  actinides  Is  suppressed  by  an  electromagnetic  field 
when  >  0"  or  equivalently  when  B  >  . 

It  has  been  sho%m  that  fissile  actinide  nuclei  in  the  presence  of  an  elec¬ 
tromagnetic  field  may  under  some  conditions  be  cooled  so  as  to  inhibit  binary 
fission  by  thermal  neutrons.  For  actinide  nuclei  in  an  electromagnetic  field 
the  internal  phase  angle  6^  of  the  atomic  number  is  an  increasing  function  of 
the  magnetic  induction  B  which  can  be  represented  by  equation  (406) .  When  the 
strength  of  the  static  magnetic  field  exceeds  a  critical  value  required  for  bi¬ 
nary  fission  suppression,  which  is  given  by  equation  (406)  for  the  general  case 
or  by  equation  (407)  within  a  simple  approximation,  the  phase  angle  8^  exceeds 
a  critical  value  6|  which  is  required  for  binary  fission  suppression  so  that 

6  >  0^  (412) 

z  z 

S 

where  6^  is  given  exactly  by  equation  (365)  and  approximately  by  equation  (395). 
The  corresponding  static  magnetic  induction  field  condition  for  binary  fission 
suppression  in  the  actinides  is  given  by 

B  >  (413) 

where  from  equation  (407)  the  following  approximate  result  can  be  used 

-  1)/X]^^^  1  <  X  <  “  (^1^) 

where  x  “  fissility  parameter  of  an  actinide  nucleus.  Values  of  the  static 
magnetic  induction  field  B^  required  to  suppress  thermal  neutron  induced  binary 
fission  appear  in  Table  1  for  various  actinide  nuclei.  The  values  of  are  in 
the  teratesla  range  which  shows  that  the  static  magnetic  field  required  for  the 
inhibition  of  fission  in  the  actinides  is  much  too  large  for  practical  purposes. 
However,  more  reasonable  results  can  be  obtained  with  a  properly  tuned  electro¬ 
magnetic  field  in  the  form  of  y  rays. 

6.  QUATERNARY  FISSION  OF  y  RAY  COOLED  ACTINIDE  NUCLEI.  This  section  sug¬ 
gests  that  y  ray  induced  quaternary  fission  can  occur  in  fissile  actinide  nuclei 
in  which  the  binary  fission  mode  has  been  suppressed  by  the  cooling  effects  of 
a  Y  ray  field.  Numerical  values  of  the  static  magnetic  induction  field  B^  re¬ 
quired  to  suppress  thermal  neutron  induced  binary  fission  in  the  actinides  were 
presented  in  Section  5,  and  were  found  to  be  too  large  for  practical  applica¬ 
tions.  This  section  gives  the  corresponding  values  of  the  dynamic  magnetic  in¬ 
duction  field  By  associated  with  the  y  rays  that  are  required  to  suppress  binary 
fission  in  the  actinide  elements.  Thermal  neutron  induced  binary  fission  will 
be  suppressed  in  the  actinides  such  as  and  ^^^Pu  when  these  nuclei  are 

cooled  by  a  properly  tuned  bath  of  ambient  y  rays.  However  each  of  the  two  sub¬ 
actinide  lobes  of  the  distorted  actinide  nucleus,  which  is  in  the  y  ray  cooled 
binary  fission  suppressed  state,  can  undergo  y  ray  catalyzed  thermal  neutron 
induced  binary  fission  with  the  result  that  the  actinide  nucleus  experiences 
quaternary  fission.  Examples  of  thermal  neutron  induced  quaternary  fission  re¬ 
actions  in  y  ray  cooled  actinides  are  given.  The  photonuclear  sum  rule  for 
these  reactions  is  evaluated.  The  resulting  clean  fission  process  for  actinide 
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nuclei  such  as  and  ^^®Pu  can  be  used  to  develop  environmentally  safe  nu¬ 

clear  power  reactors  because  the  enhanced  quaternary  fission  of  these  heavy 
elements  produces  relatively  light  nuclei  as  fission  products.  TWo  design  con¬ 
cepts  of  Y  ray  cooled  actinide  quaternary  fission  nuclear  reactors  are  presented . 

A.  Conditions  for  the  Thermal  Neutron  Induced  Quaternary 
Fission  of  y  Ray  Cooled  Actinides. 


First  the  thermal  neutron  induced  binary  fission  mode  for  the  fissile  ac¬ 
tinides  must  be  suppressed.  The  suppression  of  binary  fission  in  the  actinides 
can  be  accomplished  by  a  y  ray  field  which  is  tuned  to  the  giant  dipole  reson¬ 
ance  frequency  of  the  actinide  element  that  is  to  be  used  in  a  clean  fission 
nuclear  reactor.  The  magnitude  of  the  magnetic  induction  field  of  the  y  rays 
is  obtained  by  first  noting  that  at  the  resonance  frequency  the  dynamic  nuclear 
magnetic  stiffness  coefficient  is  given  by’® 


K 


By  ^ 

0z 


C  K® 
r  0z 


tesla 


(415) 

(416) 


where  =  dynamic  nuclear  magnetic  stiffness  coefficient  and  where^® 
o  =  3.054  X  10'^ 


(417) 


By 

Values  of  4^  and  K02  appear  in  Table  2  for  various  actinide  nuclei.  For  binary 
fission  suppression  in  the  actinides  using  y  rays  the  dynamic  magnetic  induction 
field  By  of  the  y  rays  and  the  corresponding  phase  angle  6^  of  the  atomic  number 
must  satisfy 

B  ^  0  >  0^  (418) 

Y  Y  z  z 

where 

B  =  tan  0  B®  =  tan  0®  (419) 

Y  0z  z  y  Qz  z 

S 

where  02  is  given  by  equation  (365)  or  (395) .  For  the  choice  of  the  approximate 
values  of  0^  given  by  equation  (395)  the  critical  dynamic  magnetic  induction 
field  of  the  y  rays  required  for  binary  fission  suppression  in  the  actinides  is 
given  by 


Ke;[(x  -  i)/x] 


1/2 


1  <  X  ^ 


(420) 


where  x  =  fissility  parameter  for  an  actinide  nucleus.  Table  1  gives  values  of 
By  for  selected  actinide  nuclei.  The  calculation  of  the  giant  dipole  resonance 
frequency  of  atomic  nuclei  requires  an  estimation  of  the  spring  constant  of  an 
atomic  nucleus.^®  The  values  of  the  nuclear  spring  constant  k  ,  resonance  fre¬ 
quency  f and  resonance  wavelength  Xj-  of  the  incident  y  rays  are  given  in  Table 
2  for  selected  actinide  nuclei.  For  resonant  incident  y  rays  the  critical  dy¬ 
namic  magnetic  field  strength  Hy  ,  dynamic  electric  field  strength  Ey  ,  power 
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s  s  s 

dmslty  Py  t  photon  energy  ty  ,  photon  flttx  density  ^  and  photon  nuaher  density 
ny  that  are  required  to  suppress  binary  fission  in  selected  actinide  nuclei  are 
given  in  Table  3.  The  details  of  these  calculations  are  given  in  Reference  26. 
The  Y  cooling  of  the  binary  fission  process  is  a  nuclear  analog  of  the  cool¬ 
ing  of  electrons  and  atoms  by  laser  light. 


For  a  value  of  the  y  ray  magnetic  Induction  field  By  that  satisfies  the 
suppression  condition  for  thermal  neutron  Induced  binary  Hssion  of  actinide  nu¬ 
clei  given  by  equation  (418) ,  an  actinide  nucleus  may  be  considered  to  be  com¬ 
posed  of  two  lobes  of  subactinide  nuclei  a  and  b  into  which  the  actinide  nucleus 
would  fission  were  it  not  for  the  cooling  effects  of  the  y  ray  electromagnetic 
field  (see  Figure  5).  However,  thermal  neutron  Induced  binary  fission  of  the 
component  subactlnlde  nuclei  lobes  can  occur  if  the  electromagnetic  field  of  the 
Y  rays  is  sufficiently  strong  as  to  make  the  internal  phase  angles  of  the  atomic 
numbers  of  the  component  subactlnlde  nuclei  a  and  b  larger  than  their  critical 
values  required  for  y  ray  catalyzed  thermal  neutron  induced  fission^ ^ 

e  >  0^  6  K  ^ 

za  za  zb  zb 


F  F 

where  62a  ®zb  given  exactly  by  eqiiation  (39)  or  approximately  by  equa¬ 
tion  (41)  in  terms  of  the  fissility  parameters  Xa  Xt  respectively  of  the 
two  subactlnlde  nuclei  lobes.  The  net  result  is  y  ray  catalyzed  thermal  neutron 
induced  quaternary  fission  of  fissile  actinide  nuclei  as  shown  in  Figure  5. 
Accordingly,  the  magnetic  induction  field  conditions  for  quaternary  fission  of 
Y  ray  cooled  actinide  nuclei  using  thermal  neutrons  are  given  by 


>0^ 

za 

«zb  >  <b 

0  >  0® 
z  z 

(422) 

>  B^ 
ay 

B  >  bJ^ 

Y  by 

b  ^B® 
y  Y 

(423) 

C 

where  the  critical  magnetic  field  By  required  for  y  ray  cooling  of  the  actinide 
nuclei  is  given  by  equation  (420),  and  where  equations  (41)  and  (42)  give  the 
following  approximate  expressions  for  the  critical  magnetic  induction  fields 

and  required  for  ray  catalyzed  thermal  neutron  induced  fission  of  the 
subactinide  nuclei  lobes ^ 


(424) 

(425) 


where  Xa  Xk  =  fissility  parameters  of  the  subactinide  component  nuclei  lobes 
a  and  b  .  The  two  subactinide  nuclei  lobes  of  the  distorted  binary  fission 
suppressed  actinide  nucleus  will  undergo  binary  fission  so  that  thermal  neutron 
induced  quaternary  fission  will  be  the  dominant  fission  decay  mode  of  a  fissile 
actinide  nucleus  in  a  y  ray  field  that  satisfies  the  conditions  of  equation 
(423).  The  characteristics  of  the  electromagnetic  field  required  for  the  y  ray 
catalyzation  of  thermal  neutron  induced  binary  fission  in  the  subactinide  ele¬ 
ments  has  been  treated  in  the  literature  and  will  not  be  repeated  here  for  the 
subactinide  nuclear  lobes  a  and  b  . 
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The  fission  product  nuclei  of  the  y  ray  catalyzed  quaternary  fissiui  pro¬ 
cess  are  relatively  light  nuclei  which  exhibit  low  level  beta  decays  and  are 
not  harmful  to  the  environment.  Tables  1  through  3  give  the  relevant  charac¬ 
teristics  of  the  electromagnetic  field  of  the  y  rays  that  are  required  for  the 
suppression  of  binary  fission  in  the  actinides  and  which  are  described  by  equa¬ 
tions  (392)  and  (A20) .  The  calculations  involved  in  the  preparation  of  Tables 
1  through  3  are  analogous  to  those  given  in  Reference  26  except  that  now  equa¬ 
tions  (392)  and  (420)  determine  the  basic  calculation  of  the  electromagnetic 
field  strength  required  for  binary  fission  suppression  in  the  actinides. 

6.  Examples  of  y  Rsy  Catalyzed  Thermal  Neutron  Induced 
Quaternary  Fission  of  the  Actinides. 

Consider  now  some  typical  examples  of  the  quaternary  fission  of  the  fis¬ 
sile  actinides  by  y  ray  suppression  of  the  binary  fission  of  the  parent  fissile 
actinide  nucleus  and  the  y  ray  catalyzation  of  thermal  neutron  induced  binary 
fission  of  the  subactlnlde  nuclei  lobes  of  the  parent  actinide  nucleus.  Typical 
reactions  of  this  form  require  one  neutron  for  each  subactinide  lobe,  and  there¬ 
fore  two  incident  neutrons  are  required  for  the  quaternary  fission  of  a  fissile 
actinide  nucleus.  Therefore  including  absorption  at  least  four  fission  pro¬ 
duct  neutrons  are  required  for  each  quaternary  fission  reaction  in  order  to 
have  the  sustained  fission  reactions  required  for  the  operation  of  a  nuclear 
reactor.  In  addition,  a  y  ray  photon  bath  is  required  to  suppress  the  binary 
fission  process  in  the  fissile  actinides  and  to  catalyze  the  binary  fission  of 
the  subactlnlde  lobes  of  the  y  ray  cooled  actinide  nuclei. 


Typical  y  ray  catalyzed  thermal  neutron  induced  quaternary  fission  reac¬ 
tions  for  Y  ray  cooled  fissile  actinide  nuclei  such  as  and 

will  now  be  presented 
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^As  -h  ^^^Pr  4  Jn  -h  y 


(virtual  y  ray 
cooled  lobes) 


^  ,^Be  -f-  llCu  +  ^jBe  -h  ^^^^Cs  +  4  Jn  y 


(426) 
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”Br  +  “^«^La  +  4  Jn  y 


(virtual  y  ray 
cooled  lobes) 


(432) 


“Be  -h  llGa  +  “Be  +  “«I  +  4  Jn  y 
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(433) 


-*■  3  56®*  +  4  Jn  +  Y  (y  ray  cooled  lobes)  (434) 


^jBe  +  llGe  +  ^jBe  +  ^’^Te  +  a  Jn  +  y 


(435) 
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=>  “b  +  ’?Zn  +  “b  +  “iXe  +  4  in  +  y 


(451) 
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®^Br  +  +  4  +  Y 
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(456) 


-*■  ||Sr  +  ^sgBa  +  4  Jn  +  y  (y  ray  cooled  lobes)  (457) 
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+  ^sjCs  +  4  in  +  Y  (y  ray  cooled  lobes)  (459) 


39 

9 


Be  +  5®Br  +  “fie  +  “Jsb  +  4  Jn  +  y 


(460) 
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(virtual  y  ray 
cooled  lobes) 
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(461) 

(462) 


In  this  manner  the  dangerous  fission  products  such  as  3eSr  can  be  eliminated 
from  nuclear  reactor  wastes. 


C.  Y  Ray  Cooled  Actinide  Fission  Reactors. 

The  fissile  actinide  elements  can  be  used  to  power  clean  fission  nuclear 
reactors  if  a  sufficiently  high  flux  of  y  rays  is  employed  to  cool  the  actin¬ 
ides  to  the  point  where  thermal  neutron  induced  binary  fission  is  suppressed 
and  where  y  ray  catalyzed  thermal  neutron  induced  binary  fission  can  occur  in 
the  subactinide  component  lobes  of  the  y  ray  cooled  fissile  actinides.  The 
result  is  that  y  ray  catalyzed  thermal  neutron  induced  quaternary  fission  is 
the  dominant  fission  mode  for  this  type  of  nuclear  reactor.  Tables  1  through 
3  give  the  characteristics  of  the  y  ray  fields  required  to  suppress  thermal 
neutron  Induced  binary  fission  in  selected  actinide  nuclei.  These  are  only 
approximate  results  because  the  Bohr-Wheeler  fission  condition  describes  only 
the  gross  features  of  nuclear  fission,  and  in  fact  Table  1  shows  that  an  in- 

^  9  1  S 

correct  prediction  is  made  for  U  by  predicting  that  this  nucleus  is  not 
fissile.  Figures  6  and  7  present  two  y  ray  cooled  actinide  quaternary  fission 
reactor  designs  that  can  be  used  as  clean  fission  nuclear  reactors. 
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In  order  to  Insure  clean  fission  any  design  of  a  y  ray  cooled  actinide 
quaternary  fission  reactor  must  Insure  that  all  fissile  actinide  nuclei,  such 
as  are  cooled  by  y  rays  when  thermal  neutrons  are  present,  otherwise 

ordinary  binary  fission  will  occur  and  the  nuclear  reactor  will  run  in  an  un¬ 
clean  mode.  The  problem  here  Is  that  neutrons  generally  penetrate  matter  eas¬ 
ily  except  for  special  materials  such  as  graphite  or  water,  while  y  rays  are 
readily  absorbed  In  most  materials  although  quartz  is  a  reasonable  transmitter. 
For  y  ray  catalyzed  thermal  neutron  Induced  quaternary  fission  the  y  rays  and 
the  thermal  neutrons  must  coexist  together  in  the  regions  of  a  fuel  element 
such  as  The  thermal  neutrons  must  be  prevented  from  penetrating  into  the 

by  themselves  after  the  y  rays  have  been  absorbed  otherwise  ordinary  bi¬ 
nary  fission  will  occur.  A  possible  way  to  acomplish  this  would  be  to  have 
fuel  embedded  in  quartz  which  Is  relatively  transparent  to  y  rays.  The 
neutron  source  and  the  y  ray  source  should  be  in  close  proximity  in  order  to 
insure  that  is  simultaneously  under  the  influence  of  y  rays  and  thermal 

neutrons.  Ideally  the  y  rays  should  originate  from  the  decay  of  a  radioactive 
source  mixed  with  the  ^  fuel,  however  because  the  energy  of  the  y  rays 
needs  to  be  in  a  range  that  corresponds  to  the  giant  dipole  resonance  frequency 
of  actinide  nuclei,  12-14  MeV,  it  is  likely  that  an  artificial  source  of  y  rays 
will  be  required.^® 

D.  Electric  Dipole  Sum  Rule  for  y  Ray  Cooled  Actinides. 

This  section  considers  the  effect  of  y  ray  cooling  on  the  form  of  the 
electric  dipole  sum  rule  for  actinide  nuclei  whose  fissility  parameters  satisfy 
X  ^  1  .  Consider  y  rays  and  thermal  neutrons  incident  on  actinide  nuclei  in 
which  a  y  ray  cooling  mode  is  induced  with  an  associated  depressed  binary  fis¬ 
sion  rate.  The  y  ray  energies  are  in  the  range  of  12-14  MeV  for  the  excitation 
of  the  giant  dipole  resonance  in  the  actinides.^®"’®®*'*® 

The  conventional  electric  dipole  sum  rule  for  photonuclear  reactions  is 
written  in  the  standard  incoherent  spacetime  form  as  follows^ 

G.  =  /  a  de  =  gZN/A  (463) 

me  ■'  " 

where  a  =  photonuclear  reaction  cross  section  for  incoherent  spacetime,  Z  ,  N 
and  A  =  atomic  number,  neutron  number  and  atomic  mass  number  of  the  target  nu¬ 
cleus,  and  where"*® 

g  =  ln^e^h/  (m  c)  0.06  MeV  b  (464) 

where  the  integral  is  taken  over  photon  energies  up  to  30  MeV.  The  concept  of 
the  broken  symmetry  forms  of  the  atomic  number,  neutron  number  and  atomic  mass 
number  suggests  that  a  complex  number  generalization  of  the  photonuclear  reac¬ 
tion  sum  rule  should  be  written  as 

G  =  /  o  de  =  gzn/a  (465) 

where  o  =  complex  number  photonuclear  reaction  cross  section,  G  =  complex  num¬ 
ber  integrated  photonuclear  cross  section,  and  z  ,  n  and  a  =  complex  number 
atomic  number,  neutron  number  and  atomic  mass  number  respectively  which  are 
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given  by  equations  (43)  through  (45).  The  complex  numbers  a  and  G  can  be  rep¬ 
resented  as 


a  -  o  exp(je^) 


G  -  G  exp(jep) 


(466) 


which  are  complex  numbers  in  an  internal  space.  Equation  (465)  can  be 
for  actinide  nuclei  using  equation  (46)  as 


written 


G  »  gzn/a  ■  gZN/A  sec  0^  sec  0^  sec  0^ 

0^-0  +0  -  0 
G  z  n  a 


(467) 

(468) 


Equation  (465)  can  also  be  written  as 


G  -  /  a  sec  8^^  exp[j(0^  ®e 


(469) 


-  /  oe  CSC  exp[J(0^  +  0^  +  e^p]d0^ 


(470) 


where  the  complex  number  photon  energy  is  written  as 
e  -  e  exp(j0^) 
and  where 


(471) 


tan  6  =  e30  /3e 

ee  e 


(472) 


The  component  form  of  equations  (469)  and  (470)  are  written  as 
00 

G  cos  Q„  -  (  a  sec  8  cos(0  +0  +8  )de 

G  i  ee  '  o  e  ee' 


(473) 


=  /  oe  CSC  8  cos(0  +0  +8  )d6 

•’  'a  e  ee  e 


(474) 


G  sin  e_  =  /  o  sec  8  sin(e  +0  +8  )de 

cl  cc  '  a  e  ee 


(475) 


/  oe  CSC  8  sin(0  +6  +8  )d6 

<  '  o  e  ee  e 


(476) 


which  are  generally  valid  equations.  From  equations  (467)  and  (468)  it  follows 
that  for  actinide  nuclei 

G  cos  0„  =  gZN/A  sec  0  sec  0  sec  ^0  cos(0  +0  -  0  )  (477) 

G  z  n  a  zna 


G  sin  d  =  gZN/A  sec  0  sec  0  sec  0  sin(0  +0  -  0  ) 


zna 


(478) 
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Generally  G  cos  6Q>01fZ>0,N>0  and  A  >  0  . 

The  upper  Integration  limit  of  ir/6  in  equations  (470),  (474)  and  (476) 
arises  from  the  conservation  of  momentum  for  the  photon-nucleon  interaction 
which  can  be  written  as 

e/c  -  hv/c  ■  mv  (479) 

where  the  complex  number  photon  frequency  v  and  the  complex  number  nucleon  ve¬ 
locity  V  are  written  as 

V  -  V  exp(je^)  V  =  V  exp  (jS^)  (480) 


Equations  (479)  and  (480)  can  be  written  as 

e/c  “  hv/c  =  mv  e*e“e»e-e  (48i) 

e  V  V  X  t 

which  are  valid  for  the  inelastic  photonuclear  reaction.  For  coherent  space- 
time  =  Tr/3  ,  0(;  =  ir/b  and  0y  «=  Tr/6  ,  so  that  0e  “  ii/6  for  Inelastic  photonu¬ 
clear  reactions  in  coherent  spacetime. 

For  nuclei  whose  fissility  parameters  are  greater  than  unity,  x  ^  1  »  and 
which  have  been  cooled  to  the  point  of  the  suppression  of  thermal  neutron  in¬ 
duced  binary  fission  by  an  external  y  ray  field,  the  critical  internal  phase 
angles  of  the  atomic  number,  neutron  number  and  atomic  mass  number  associated 
with  binary  fission  suppression  are  related  by  equation  (384)  as  follows 

0®  =  20®  -v  0.760®  0®  'V  2.630®  (482) 

a  z  n  n  z 


and  therefore  equations  (467),  (468)  and  (482)  give  approximately 

G  'v/  gZN/A  (1  -  tan^0®)(4  cos^0^  -  3)"^ 

z  z 


0. 


(483) 

(484) 


From  equations  (475)  and  (484)  it  follows  that  for  incipient  fission  spacetime 
is  incoherent  and 


0 

a 


=  0 


0=0 

e 


(485) 


so  that  the  photonuclear  interaction  for  the  case  of  fission  must  be  scalar 
(©o  “  0)  with  incoherent  photon  interactions,  and  equations  (473)  and  (483) 
through  (485)  become 

G  =  /  a  d£  -v  gZN/A  (1  -  tan^0®)(4  cos^0®  -  3)"^  (486) 

o  z  Z 

However  it  has  been  shown  in  equation  (392)  that  the  approximate  condition  for 
y  ray  induced  binary  fission  suppression  is  given  by 
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s 

tan  6 

z 


[(X  -  i)/x] 


(487) 


1/2 


where  x  ^  1  is  the  flssllity  parameter.  Combining  equations  (486)  and  (487) 
gives  for  incoherent  photon  interactions  with  the  actinides  and  for  x  1 

/  o  de  -v  gZN/A  x"^(2x  -  DO  -  2x)"^  (488) 

o 

and  therefore  the  fissility  parameter  enters  into  the  electric  dipole  sum  rule 
that  includes  catalyzed  thermal  neutron  Induced  quaternary  fission  of  the  ac¬ 
tinides  which  are  immersed  in  a  y  tay  field  that  has  suppressed  binary  fission. 
The  evaluation  of  the  electric  dipole  sum  for  the  actinides  is  done  only  approx¬ 
imately  and  is  valid  only  for  small  internal  phase  angles  or  equivalently  equa¬ 
tion  (488)  is  valid  only  for  x  '''  1  • 

7.  FINAL  STATE  ENERGY  CONDITIONS  FOR  THE  BINARY  FISSION  OF  THE  ACTINIDES 
IN  AN  EXTERNAL  FIELD.  This  section  considers  the  binary  fission  of  actinide 
nuclei  in  an  electromagnetic  or  gravitational  field  that  is  not  strong  enough 
to  suppress  the  -  Lnary  fission  process*  so  that  according  to  equation  (418)  the 
condition  that  describes  this  is  02  <  82  or  By  <  .  The  fission  products  are 

subactlnlde  nuclei  and  neutrons.  A  comparison  is  made  between  the  initial  and 
final  energy  states  of  an  actinide  nucleus  that  has  undergone  binary  fission  in 
the  presence  of  an  external  electromagnetic  or  gravitational  field.  A  fission 
reaction  in  which  a  nucleus  (Z,A)  has  split  into  two  nuclei  (Z]^,Ai)  and  (Z2tA2) 
is  written  in  the  form^^"'*'* 

(Z.A)  (Zj  .  Aj)  +  (Z2  ,  A2)  (489) 

Then  the  nucleus  (Z2  >  A2)  is  assumed  to  eject  a  neutron 

(Z2  .  A2)  (Z2  ,  A2  -  1)  +  (0.1)  (490) 

where  in  this  notation  (0,1)  is  a  single  neutron.  In  this  way  the  general  pro¬ 
cess  of  nuclear  fission  can  be  represented  by  a  nuclear  transformation  of  the 
general  form  given  in  equation  (489).  In  an  external  field  the  nuclei  repre¬ 
sented  in  equation  (489)  are  also  associated  with  complex  atomic  numbers,  neu¬ 
tron  numbers  and  atomic  mass  numbers  that  in  analogy  to  equations  (43)  through 
(45)  for  actinide  nuclei  and  equations  (8)  through  (10)  for  subactinide  nuclei 


are  given  by 

z  =  z  exp(i0  )  =  Z  sec  0  exp(j0  )  (491) 

z  z  z 

n  =  n  exp(j0  )  =  N  sec  8  exp(j0  )  (492) 

n  n  n 

a  =  a  exp(j0  )  =  A  sec  0  exp(j0  )  (493) 

3  a  3L 

=  z^  exp(j0^P  =  Zj  cos  0^^  exp(j0^j)  (494) 

iij  »  n^  exp(j0^P  =  N^  cos  0^^  exp(j0^j)  (495) 

“  a^  exp(j0^P  =  Aj  cos  0^^  exp(j0^j)  (496) 


632 


*2  “  *2  **P^^®z2^  "  ^2  ®*2 

02  -  n2  exp(j0^2)  ”  **2  ‘^°®  ®n2  **P^J®n2^ 

®2  "  ®2  ®*P(J®a2^  “  ^2  ‘'°®  ®a2  **P<J®a2^ 


(497) 

(498) 

(499) 


The  determination  of  the  energy  released  during  the  fission  reaction  given  in 
equation  (489)  requires  that  all  of  the  nine  internal  phase  angles  that  appear 
in  equations  (497)  through  (499)  be  determined,  and  the  procedure  for  doing 
this  will  now  be  given. 

A.  Determination  of  the  Internal  Ftiase  Angles  of  the  Atomic  Number, 
Neutron  Number  and  Atomic  Mass  Number  for  the  Initial  and  Final 
States  of  Fission  of  the  Actinides  in  an  Electromagnetic  Field. 

The  nuclei  involved  in  the  fission  reaction  given  by  equation  (489)  are 
subject  to  the  following  scalar  baryon  number  conservation  equations^ ^ 


A  =  Z  +  N 

Aj  =  Zj  +  Nj 

A2  -  Z2  +  N2 

(500) 

A  =  Aj  +  A2 

z  =  Zl  +  Z2 

N  -  N^  +  N2 

(501) 

In  an  external  field  the  nuclei  represented  in  equation  (489)  are  subject  to 
the  following  complex  atomic  number,  neutron  number  and  atomic  mass  number 
conservation  equations  similar  to  equation  (54)^® 


a  +  W  =  z  +  n 


ai  +  Wi  =  Zf  +  nj 


a  +  W 


ii  +  32  z  +  -  Zj  +  Z2 


32  +  W2  =  Z2  +  n2 


n  +  W  »  n,  +  n^ 
n  1  2 


(502) 

(503) 


Equations  (502)  and  (503)  show  that  all  of  the  W’s  are  not  independent,  and 
in  fact  they  are  subject  to  the  following  equation 


26 


W-W,  -W-^W  -W  -W 
1  2  a  z  n 


(504) 


Equations  (502)  and  (503)  can  be  combined  with  equations  (491)  through  (499) 
to  yield  the  following  twelve  equations  that  are  valid  for  the  actinide  ele¬ 
ments  and  their  subactlnlde  fission  product  nuclei 


A  +  W  =  Z  +  N 

A  tan  9  =  Z  tan  9  +  N  tan  0 

a  z  n 

Ai  cos^e^^  +  «!  =  cos^e^^  +  Nj  cos^e^^ 

A,  cos  0  ,  sin  0  ,  =  Z,  cos  0  ,  sin  0 

1  al  al  1  zl 


,  +  N,  cos  0  ,  sin  0  , 
zl  1  nl  nl 


(505) 

(506) 

(507) 

(508) 
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*2  “*^2  +  "2  ■  ^2  “*^*.2  *  "2  ‘=°*^*n2  <50’) 

A2  COS  6^2  "  ^2  ®*2  ®s2  **2  ®n2  ®n2  (510) 


A  +  W  - 
a 

A,  cos^0  ,  + 
1  al 

Az  cos^0^2 

(511) 

A  tan  0 

a 

"  Aj  cos  0^j 

sin  0^j  +  A2 

cos 

«a2 

sin 

«a2 

(512) 

Z  +  W  - 
z 

Z,  COS^0  ,  + 
1  zl 

Zz  cos\2 

(513) 

Z  tan  0 

z 

*  Z,  cos  0  , 

1  zl 

sin  0^^  +  Z2 

cos 

®z2 

sin 

«z2 

(514) 

N  +  W  - 
n 

"l 

Nz  005^0^2 

(515) 

N  tan  0 

n 

■  "1  ®nl 

sin  0^^  +  N2 

cos 

®n2 

sin 

®n2 

(516) 

where  Z  ,  N  ,  A  ;  Z]^  ,  ,  A|^  and  Z2  •  N2  >  A2  are  known  quantities. 

There  are  fifteen  unknown  quantities  in  the  problem  of  the  fission  of  an 
atomic  nucleus  in  the  presence  of  an  electromagnetic  field: 


*  «z  *  «n  •  «a 

(517) 

'^l  ’  ®zl  ’  ®nl  ’  ®al 

(518) 

^2  ’  ®z2  ’  ®n2  ’  ®a2 

(519) 

«z  •  '^n  •  '^a 

(520) 

There  are  fifteen  equations  to  determine  these  quantities  and  they  are:  the 
twelve  equations  (505)  through  (516),  the  two  fission  stability  equations 
(356)  and  (357)  which  determine  0^  and  0^  in  the  forms  of  equations  (365)  and 
(366),  and  finally  equation  (504)  which  relates  the  various  W-functions.  The 
values  of  the  W's  for  actinide  nuclei  and  their  subactlnlde  nuclei  fission 


products  are  obtained  from  equations  (505)  through  (516)  to  be 

W  =  0  (521) 

Wj  =  -  Aj/2[1  +  (1  -  4fjp^^^]  +  Zj  cos^0^j  +  Nj  cos^0^^  (522) 

W2  -  -  A2/2[1  +  (1  -  4f^2)^^^l  +  ^2  cos^0^2  ^2 

-  -  Z  +  Zj  cos^0^j  +  Z2  cos^®jj2 

-  -  N  +  Nj  ‘^°®^®nl  *^2 
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-  -  A  +  A,  co«2e^j+  Aj  ^om\^ 


where 


‘wl  ■  •‘"  «zl  “•  «zl  "l  «nl  ®nl> 

*M2  ■  *2'<^2  ®z2  ®z2  *  "2  ®n2  *1.2' 

and  where 

cos^e^j  -  i/2[i  +  (1  - 

cos^e^2  “  1/2[1  +  (1  - 


(526) 

(527) 

(528) 

(529) 

(530) 


tan 

0 

z 

-  Zj/Z 

cos 

«zl 

sin 

®zl 

+  z^/z 

cos 

«z2 

sin 

®z2 

(531) 

tan 

0 

n 

-  Nj/N 

cos 

«nl 

sin 

®nl 

+  N2/N 

cos 

«n2 

sin 

®n2 

(532) 

tan 

0 

a 

*  Aj/A 

cos 

«al 

sin 

®al 

+  A^/k 

cos 

«a2 

sin 

®a2 

(533) 

Equations  (521)  through  (526)  can  be  rewritten  as 

W  =  0  (534) 

W,  =  -  A,  cos^e  ,  +  Z,  cos^e  ,  +  N,  cos^e  ,  (535) 

1  1  al  1  zl  1  nl 

2  2  2  2 

=  Z, (cos  0  ,  -  cos  0  ,)  +  N, (cos  0  ,  -  cos  0  ,) 

1  zl  al  1'  nl  al 

W2  ■  -  A2  cos^0^2  ^2  co®^®22  **2  ^°®^®n2  (536) 

2  2  2  2 

■  Z2(cos  0^2  “  cos  0^2)  N2(cos  0^2  “  cos  0^2) 


-  -  Zj  Sln^e^,  -  Zj  (537) 

W  -  -  N,  sln^0  ,  -  N„  sln^0  .  (538) 

n  1  nl  2  nz 

“a  "  -  Al  sin^^al  "  A2  sln^0^2 

Therefore  for  actinide  nuclei 

W=0  Wj>0  W2>0  (540) 

W<0  W<0  W<0  (541) 

z  n  a 
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For  the  case  of  fission  of  actinide  nuclei  in  an  electromagnetic  field  only 
W  =  0  while  the  other  W  functions  have  nonzero  values. 

B.  Energy  Released  from  Nuclear  Fission  in  an  External  Field. 

The  Q  value  of  a  nuclear  reaction  is  a  measure  of  the  energy  released  in 
a  nuclear  fission  process.^®  In  this  paper  a  complex  number  generalization  of 
the  standard  definition  of  the  Q  value  is  given  by^® 

Q/c^  =  M(A.Z)  -  M(A^.Z^)  -  M(A2,Z2)  (542) 


where  as  in  equation  (294)^® 

M(A,Z)  =  nm  +  zm„  -  B(A,Z)/c^ 
n  n 

«  -  2 

M(ApZj)  -  n^m^  +  z^m^  -  B(ApZj)/c 

_  -  2 

M(A2,Z2)  =  n2m^  +  Z2mj^  - 

Then  the  Q  value  can  be  written  as 

-  _  _  _  _  2 
Q  =  [(n  -  n^  -  n2)m^  +  (z  -  -  Z2)m^]c 

+  B(A^,Z^)  +  B(A2,Z2)  -  B(A,Z) 


(543) 

(544) 

(545) 


(546) 


Using  equation  (503)  allows  equation  (546)  to  be  written  as 

Q  =  +  Q2  (547) 

where 

^1  ^  ”  ^'^n^n  (548) 

Q2  =  B(A^,Zj)  +  B(A2.Z2)  -  B(A,Z)  (549) 


where  and  Wj^  are  given  by  equations  (537)  and  (538)  respectively. 

Because  Wn  <  0  and  <  0  it  follows  that  for  actinide  nuclei 

Qj  >  0  (550) 

The  value  of  arises  from  the  rest  mass  terms  in  equations  (543)  through 
(548)  .  The  actual  rest  mass  is  unchanged  in  a  nuclear  fission  process  because 

Nm^  4-  Zm^  -  (N^m^  +  Zj^m^^)  -  (N2ni^  +  ^2™H^  ~  ^  (551) 

which  is  always  true  because  of  the  absolute  validity  of  baryon  number  conser¬ 
vation  which  for  the  present  case  is  written  as 
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Z  -  Zj  +  Z2  N  -  Nj  +  (552) 

A  finite  value  of  Qj  results  from  the  special  form  of  the  conservation  law  of 
complex  baryon  numbers,  which  for  the  complex  atomic  number,  neutron  number 
and  atomic  mass  number  are  given  in  equations  (502)  and  (503) .  The  expression 
for  Qj  for  actinide  nuclei  can  be  rewritten  using  equations  (537),  (538)  and 
(548)  as 

Qj/c^  =  m^(Nj  sin^e^j  +  N2  sin^e^2)  +  ^2 

In  general  Qj^  >  0  for  actinide  nuclei.  For  zero  value  of  the  applied  external 
field  =  0  because  all  Internal  phase  angles  have  zero  values,  and  therefore 
Wn  =  0  and  W2  =  0  . 

The  value  of  Q2  can  be  calculated  by  combining  equations  (236)  and  (549) . 
This  Is  easily  done  for  symmetric  fission  and  under  the  approximation 


0  ,  0  -  0 
zl  z2  z 


e  ,  6  -  -v-  0 

nl  n2  n 


For  symmetric  fission  equation  (549)  becomes 


Q2  =  2B(A/2,Z/2)  -  B(A,Z) 


(554) 


(555) 


Under  these  assumptions  the  value  of  Q2  Is  given  by  the  following  complex  num¬ 
ber  generalization  of  the  standard  scalar  result^ ^ 


Q2  =  (1  -  +  (1  -  2"^^^)6i^/a^^^ 

=  -  0.2695^^^  +  0.376z^/a^^^ 


(556) 


The  simple  form  in  equation  (556)  results  from  the  approximation  given  in  equa¬ 
tion  (554) .  The  value  of  Q  is  then  written  as 


Q  =  Qj  -  0.26=73^^^  +  0.376z^/a^^^ 


(557) 


The  measured  value  of  Q  for  the  actinides  is  given  by  the  real  part  of  equation 
(557) 

Q  =  Q  -  0.26yA^^^  sec^^^e  cos(e  +  2/30  )  (558) 

m  l  a  Y  a 

+  0.376Z^A"^^-  s€G?0  sex:~-^^-^e--cos-(0-;^  +  20  -  1/30^) 

z  a  o  z  a 

Equation  (558)  can  be  compared  to  the  conventionally  calculated  value  of  Q 
which  is  given  by^^ 

+  0.376Z^A"^'^^  (559) 


For  the  case  of  zero  external  field  equation  (558)  reduces  to  equation  (559). 


A  condition  that  determines  the  possibility  of  the  final  fission  state 
to  occur  can  be  obtained  from  the  Q  value  for  the  nuclear  fission  process. 

The  complex  number  generalization  of  this  condition  is 

Q  E*  (560) 

where  E*  *  complex  number  Coulomb  potential  energy  of  two  spherical  nuclei 
(Z/2,A/2)  in  geometrical  contact.  This  Coulomb  energy  can  be  written  as  a 
simple  complex  number  generalization  of  the  standard  scalar  result^ ^ 

E*  -  l/2e^(z/2)^/[b(i/2)^^^]  (561) 

=  2^^^(l/8)(5/3)6i^/i^^^  =  0.2626z^/i^^^ 

where  as  before  in  equation  (264) 

6  =  3/5e^/b  =  0.863/b  =  (0.863/1 .523)k^  MeV 

where  b  =  complex  number  radius  parameter  given  by  equation  (226) . 

=  1.35  fm“  as  in  equation  (267)  it  follows  that 

6  =  0.765  MeV  =  765  keV 

and  0^  is  given  by  equation  (266)  . 

Combining  equations  (557),  (560)  and  (561)  gives  the  final  state  fission 
energy  condition  as 

-  +  0.26Ya^^^  =  0.116i^/i^^^  (564) 

This  equation  can  be  used  instead  of  the  incipient  fission  condition  given  in 
equation  (355)  to  determine  ©z  and  ©g  •  However  because  of  the  presence  of  the 
functions  and  Wz  the  full  set  of  thirteen  equations  (504)  through  (516) 
must  be  solved  in  conjunction  with  the  two  components  of  equation  (564)  which 
are  for  the  actinide  elements 

-  Qj  +  0.26yA^^^  sec^^^e^  cos(6^  +  2/30^)  (565) 

=  0.116Z^A"^'^^  sec^e  sec"^^^©  cos(0-  +  2©  -  1/3©  ) 

z  a  6  z  a 

0.26yA^^^  sec^'^^e  sin(0  +  2/38  )  (566) 

a  Y  a 

2  -1/3  2  -1/3 

=  0.116Z  A  '  sec  e  sec  '  8  sinO-  +  2©  -  1/30  ) 

z  a  o  z  a 

If  Wjj  and  are  neglected  in  equation  (564)  so  that  =  0  ,  then  the  final 
state  fission  condition  can  be  written  as 

z^/i  =  k'y/5  k'  =  2.36  (567) 


(562) 

For 

(563) 
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Equation  (567)  is  the  same  form  as  the  Incipient  fission  condition  given  in 
equation  (355)  and  the  same  form  of  solution  for  e|  and  sf  that  appears  in 
equations  (365)  and  (392)  can  now  be  used  to  determine  these  phase  angles  for 
the  final  state  fission  condition.  The  condition  for  binary  fission  in  an 
external  field  is  then  given  by  equation  (383).  Then  the  remaining  thirteen 
equations  (50A)  through  (516)  can  be  used  to  calculate  the  remaining  thirteen 
functions  listed  in  equations  (517)  through  (520). 

8.  CONCLUSION.  A  method  of  obtaining  clean  fission  nuclear  power  from 
actinide  elements  has  been  proposed  that  is  based  on  the  idea  of  using  a  y  ray 
field  to  suppress  thermal  neutron  induced  binary  fission  in  the  fissile  actin¬ 
ides  and  of  using  the  same  y  ray  field  to  catalyze  thermal  neutron  induced  bi¬ 
nary  fission  in  the  two  subactinide  lobes  of  the  distorted  y  ray  cooled  actin¬ 
ide  nuclei.  In  this  way  a  thermal  neutron  induced  quaternary  fission  process 
can  occur  in  y  ray  cooled  actinides.  The  net  result  is  that  the  fission  pro¬ 
duct  nuclei  for  quaternary  fission  of  y  ray  cooled  actinides  are  smaller  and 
less  radioactive  than  the  fission  product  nuclei  of  conventional  nuclear  fis¬ 
sion  reactions. 
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Table  1.  Electroouignetlc  Field  Characteristics  for  the  7  Ray 
Suppression  of  Binary  Fission  in  the  Actinides 


Nucleus 

zVa 

X 

n 

(degrees) 

bS 

(10“  T) 

(10®  T) 

“*Unp 

42.08 

1.1608 

20.41 

8.77 

10.59 

«iUnq 

41.44 

1.1432 

19.49 

8.34 

10.08 

40.80 

1.1256 

18.47 

7.87 

9.52 

“•No 

40.17 

1.1081 

17.35 

7.36 

8.91 

«*Md 

39.54 

1.0907 

16.09 

6.80 

8.23 

«’F!n 

38.91 

1.0734 

14.65 

6.16 

7.46 

“2Es 

38.89 

1.0729 

14.61 

6.14 

7.47 

251cf 

38.26 

1.0555 

12.92 

5.41 

6.57 

38.09 

1.0508 

12.40 

5.18 

6.32 

2**Cin 

37.77 

1.0419 

11.35 

4.73 

5.78 

37.31 

1.0293 

9.58 

3.98 

4.85 

2«Am 

37.14 

1.0245 

8.79 

3.65 

4.46 

228y 

37.12 

1.0241 

8.72 

3.62 

4.47 

239PU 

36.97 

1.0199 

7.95 

3.29 

4.04 

23*Np 

36.96 

1.0196 

7.89 

3.27 

4.02 

2*2pu 

36.51 

1.0072 

4.83 

1.99 

2.44 

36.49 

1.0067 

4.66 

1.92 

2.36 

233^ 

36.33 

1.0021 

2.62 

1.08 

1.33 

36.32 

1.0020 

2.56 

1.05 

1.31 

228pa 

36.32 

1.0019 

2.49 

1.03 

1.27 

23*u 

36.17 

0.9978 

0 

0 

0 

235u 

36.02 

0.9936 

0 

0 

0 

231pa 

35.85 

0.9889 

0 

0 

0 

227Th 

35.68 

0.9844 

0 

0 

0 

238u 

35.56 

0.9810 

0 

0 

0 

22 6t^ 

35.53 

0.9800 

0 

0 

0 

232Th 

34.91 

0.9631 

0 

0 

0 

22^Ac 

34.89 

0.9626 

0 

0 

0 
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Table  2.  Electroouignetlc  Field  Characteristics  for  the  7  Ray 
Suppression  of  Binary  Fission  in  the  Actinides 


Nucleus 

io»rr 

K|J 

k 

fr 

(10*  T) 

(10«  N/m) 

(10“  Hz) 

(f«) 

«2unp 

1.207 

2.846 

3.022 

99.18 

2“Unq 

1.208 

2.848 

1.998 

3.026 

99.06 

260Lr 

1.209 

2.850 

1.996 

3.030 

98.93 

1  ««No 

1.210 

2.851 

1.992 

3.034 

98.80 

«aMd 

1.211 

2.854 

1.990 

3.038 

98.67 

1.211 

2.855 

1.988 

3.042 

98.55 

"2es 

1.215 

2.865 

1.974 

3.062 

97.90 

251cf 

1.216 

2.866 

1.969 

3.066 

97.77 

1.219 

2.874 

1.961 

3.082 

97.25 

1.222 

2.880 

1.953 

3.095 

96.86 

1.219 

2.874 

1.961 

3.082 

97.25 

1.223 

2.882 

1.951 

3.099 

96.72 

2280 

1.236 

2.913 

1.910 

3.166 

94.69 

M«Pu 

1.226 

2.890 

1.940 

3.116 

96.19 

»*Np 

1.230 

2.900 

1.927 

3.138 

95.51 

2*»Pu 

1.223 

2.884 

1.948 

3.103 

96.59 

“^Np 

1.228 

2.894 

1.934 

3.125 

95.92 

238u 

1.231 

2.902 

1.923 

3.143 

95.38 

223Th 

1.240 

2.924 

1.896 

3.189 

93.99 

“®Pa 

1.236 

2.913 

1.910 

3.166 

94.69 

23*0 

1.230 

2.900 

1.927 

3.138 

95.51 

1  2350 

1.229 

2.898 

1.929 

3.134 

95.65 

1  231pa 

1.233 

2.907 

1.918 

3.152 

95.10 

1  “^Th 

1.237 

2.915 

1.907 

3.170 

94.55 

1  23Bo 

1.227 

2.892 

1.937 

3.121 

96.06 

1  228<p^ 

1.236 

2.913 

1.910 

3.166 

94.69 

1  232<p^ 

1.232 

2.904 

1.921 

3.147 

95.24 

1  “’Ac 

1.237 

2.^15 

1.907 

3.170 

94.55 

Table  3.  Electromagnetic  Field  Characteristics  for  the  7  Ray 
Suppression  of  Binary  Fission  in  the  Actinides 


Nucleus 

(10*  amp/n) 

Ef 

(10^2  volts/a) 

n 

(10”  W/l||2) 

(MeV) 

(10**  B-*sec-*) 

(10«  ■->) 

*®*Unp 

8.43 

3.18 

1.34 

12.50 

0.61 

2.03 

“‘Unq 

8.02 

3.02 

1.21 

12.52 

0.56 

1.86 

260Lr 

7.58 

2.86 

1.08 

12.53 

0.50 

1.67 

«9No 

7.09 

2.67 

0.95 

12.55 

0.44 

1.47 

258Md 

6.55 

2.47 

0.81 

12.57 

0.38 

1.27 

257Fm 

5.94 

2.24 

0.67 

12.58 

0.31 

1.03 

252Es 

5.94 

2.24 

0.67 

12.67 

0.31 

1.03 

251Cf 

5.23 

1.97 

0.52 

12.68 

0.24 

0.80 

2«Bk 

5.03 

1.90 

0.48 

12.75 

0.22 

0.73 

2**cm 

4.60 

1.73 

0.40 

12.80 

0.19 

0.63 

2"Cm 

3.86 

1.46 

0.28 

12.75 

0.13 

0.43 

2«Ain 

3.55 

1.34 

0.24 

12.82 

0.11 

0.37 

228u 

3.56 

1.34 

0.24 

13.10 

0.11 

0.37 

239pu 

3.21 

1.21 

0.19 

12.89 

0.09 

0.30 

3.20 

1.21 

0.19 

12.98 

0.09 

0.30 

2*2p^ 

1.94 

0.73 

0.07 

12.84 

0.03 

0.10 

1.88 

0.71 

0.07 

12.93 

0.03 

0.10 

233u 

1.06 

0.40 

0.02 

13.00 

0.01 

0.03 

223Th 

1.04 

0.39 

0.02 

13.19 

0.01 

0.03 

228pa 

1.01 

0.38 

0.02 

13.10 

0.009 

0.03 

234u 

0 

0 

0 

12.98 

0 

0 

235u 

0 

0 

0 

12.96 

0 

0 

231pa 

0 

0 

0 

13.04 

0 

0 

227Th 

0 

0 

0 

13.11 

0 

0 

238u 

0 

0 

0 

12.91 

0 

0 

228Th 

0 

0 

0 

13.10 

0 

0 

232Th 

0 

0 

0 

13.02 

0 

0 

0 

0 

0 

13.11 

0 

0 
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Figure  3.  The  fission  angles  and  versus  Che 
fissilicy  paraaecer  x  for  the  case  -  $t  . 


Figure  4.  The  fission  angles  #J  and  #*  versus  the 
fissilicy  paraaecer  x  for  the  case  . 


Figure  S.  Schenatlc  representation  of  the  Figure  6.  Design  concept  for  a  7  ray  cooled 
quaternary  fission  of  a  7  ray  cooled  actinide  quaternary  fission  nuclear  reactor 
actinide  nucleus. 


Figure  7 .  Design  concept  for  a  spherical  7  ray 
cooled  actinide  quaternary  fission  nuclear  reactor. 
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Abstract 

This  paper  proposes  a  scaling  and  squaring  geometric  series  method  along  with  the 
inverse-geometric  series  method  for  finding  the  discrete-time  (continuous-time)  structured 
uncertain  linear  model  from  the  continuous-time  (discrete-time)  structured  uncertain  linear 
systems.  Above  methods  allow  the  use  of  well-developed  theorems  and  algorithms  in  the 
discrete-time  (continuous-time)  domain  to  indirectly  solve  the  continuous-time  (discrete- 
time)  domain  problems.  Moreover,  these  methods  enhance  the  flexibility  in  modelling 
and  control  of  a  hybrid  composite  system.  It  has  shown  that  the  commosily  used  Inlinear 
approximate  model  is  a  specific  class  of  the  proposed  geometric  scries  modd. 

1.  Introduction 

Most  of  the  practical  processes  comprise  of  uncertain  plants.  The  uncertainty  about 

the  plant  arises  &om  unmodelled  dynamics,  parameter  variations,  sensor  noise,  input  sig¬ 
nal  level  constraints,  etc.  Therefore,  the  real  physical  processes  should  be  represented  by  a 
continuous-time  and/or  discrete-time  uncertain  framework.  For  digital  simulation,  param¬ 
eter  identification,  hybrid  control  design  and  digital  implementation  of  an  uncertain  linear 
system,  it  is  essential  to  convert  a  continuous- time  (discrete-time)  uncertain  linear  sys¬ 
tem  to  an  equivalent  discrete-time  (continuous-time)  uncertain  linear  model.  The  model 
conversions  of  a  nominal  continuons-time  (discrete-time)  linear  system  to  an  equivalent 
nominal  discrete-time  (continuous-time)  linear  model  have  been  reported  in  the  literature 
[1,2,3].  However,  the  method  for  model  conversions  of  uncertain  linear  state-space  models 
has  not  yet  been  fully  developed. 
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Ezzine  and  Johnson  [4]  used  a  classical  perturbation  method  [5]  to  convert  a  continuous- 
time  uncertain  linear  system  to  an  equivalent  discrete-time  uncertain  model  but  they  did 
not  solve  the  inverse  of  the  problem.  Shieh  et  al.[6]  utilized  the  bilinear  and  inverse-bilinear 
transform  method  to  carry  out  the  model  conversions.  In  this  paper,  we  use  a  scaling  and 
squaring  geometric  series  method  to  perform  the  model  conversions.  We  show  that  the 
proposed  method  significantly  improves  the  accuracy  of  the  existing  models  obtained  by 
using  the  bilinear  transform  method  [6]. 


2.  Problem  Formulation 

Consider  a  structured  continuous-time  uncertain  linear  system 


Xc{t)  =  Axc{t)  +  Buc{t), 
Veit)  =  C'o®c(0 


*c(0)  =  *c 


where 


A  —  Ao  +  AA  —  Ao  +  ^ 


B  =  Bq  -f  AB  =  Bo  +  ^  ^  AiiBi  (1^) 

»=i 

Xc{t)  €  is  the  state,  Uc{i)  €  is  the  input,  yc(0  €  is  the  output, 

(>1o,SojC'o)  are  nominal  system  matrices,  {AA,AB)  are  perturbed  uncertain  matrices, 
{Ai,Bi)  is  the  pair  of  known  constant  matrices,  ( Aoi,  Ahi)  is  the  pair  of  uncertain  scalar 
parameters.  Without  loss  of  generality,  we  can  assume  that  lAoj]  <  1  for  i  =  1,2,  •  •  • ,  fco> 
and  |Aii|  <  1  for  t  =  1, 2,  •  •  • ,  fcj,. 

The  associated  discrete-time  uncertain  linear  model  for  (1)  is 

Xi{kT  +  T)  =  Gxi{kT)  -h  Hui{kT),  x^{0)  =  x^o  (2o) 


yd{kT)  =  Cox,(fcT) 


where 


G  = 


fT 

H=  -h  AB)dT  =  {G-  I^){Ao  +  AA)-'{Bo  +  AB) 

Jo 

=  Uc{kT)  for  kT  <t  <{k  +  l)r 
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T  is  the  sampling  period  and  is  an  n  x  n  identity  matrix.  It  is  noted  that  and 
are,  in  general,  not  commutative  and  G  and  H  contain  nonlinear  uncertainty  terms 
in  AA  and  (AA,  A^),  respectively.  In  order  to  find  the  counterpart  of  the  representation 
in  (1)  from  (2),  we  linearize  the  respective  G  in  (2c)  and  H  in  (2d)  in  the  following  manner. 

G  —  Go  "I"  AG  =  Go  +  ^  ^  AsjGj  (3^) 

»=i 

H  ^Ho  +  AH  =  ^kiHi  (36) 

»=i 

where  Go  =  and  Hq  =  [Go  —  /„]A^*So  =  ^{AoTy~^ B^T .  All  matrices  in 

(2)  and  (3)  have  compatible  terminologies  and  dimensions  as  their  counterparts  in  (1). 
It  is  desired  to  find  the  pairs  (Go,^o)  ^<1  (AG,AJ7)  in  (3)  from  the  pairs  (Ao,Bo) 
and  (AAyAB)  in  (1)  such  that  the  discretized  state  x^(ibT)  in  (2a)  closely  matches  the 
state  Xe(0  ^  for  A  given  piecewise-constant  input  u^(t)  =  Ue{kT)  for 

kT<i<{k  +  1)T. 

Ezzine  and  Johnson[4]  have  shown  that  when  the  perturbation  parameter  |  A»{|  in  (Ic) 
is  suffidently  small  and  AB  =  0,  then 


where 


G  =  Go  +  A€1+0{AI^  S  Go  -h  A6 

(4tt) 

^  =  JSTo  +  Ail  +  0( A^i)  S  l7o  +  Ail 

(46) 

fT 

AG  =  Go  y  AAe^'”’  dr 

(4c) 

=  j  (^j  e^°^*-^'>AAe^‘>'^dT^Bodt. 

(4d) 

Note  that  0(6)  denotes  high-order  terms  in  c,  which  can  be  neglected  if  e  is  sufficiently 
small.  The  determination  of  the  exact  AG  in  (4c)  and  AH  in  (4d)  is  not  simple  [5]  because 
^»(~Ao)  -t-  Aj(Ao)  =  0,  where  Aj(Ao)  are  the  characteristic  eigenvalues  of  Ao. 

Shieh  et  al.  [6]  have  shown  that  the  uncertain  system  matrices  are 

AGi  =  \{Go-  /„)Ao-'AA(Go  -h  /„) 


AHi,  =  (Go  -  In)Ao^AB  +  ^{Go-  In)A-^AAHo 
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(5a) 

(56) 


and  the  counterparts  of  AG  (=  AG^  in  (5a))  and  AH  (=  AH^  in  (5b)),  defined  as  AA^ 
and  ABk,  respectively,  can  be  solved  from  respective  (5a)  and  (5b)  as 

AA»  =  2^(G,-/.)-'AC{eo +  /,)-■  (Sc) 

AB,  =  Xo(Go  -  I,)-'[AB  -  i(Go  -  /„)A.-‘AA,ffo)-  (5<0 

In  their  approach  [6],  the  time-domain  bilinear  transformation  with  the  form 

=  (/»  -  \XT)-\I^  +  ^XT) 

and  the  inverse-bilinear  transformation  with  the  form 

(/„  -  \XT)-\I^  +  ixT)  “  c-*’’ 

'  where  X  6  were  simultaneously  utilized  to  derive  (5). 


3.  The  Geometric  and  Inverse-geometric  Series  Approximation  Method 

In  this  paper,  we  use  a  scaling  and  squaring  geometric  series  method  to  linearize  the 
respective  G  and  H  in  (2).  The  geometric  series  method  is  described  as  follows. 

The  matrix-valued  function  of  with  X  G  and  a  sampling  period  T  can  be 

approximated  by  a  geometric  series  [7]  as 

G  =  c-'^  = 

=  (Q7'Pir  =  Gi  for  J  =  (6a) 


where 


Qi  = 

In  -  5r4— 

L  2(j)(m) 

Pi  = 

In  +  574-.  Jfr 
2(y)(m) 

T  <  (2)(i)(m)/l|J^||. 


if  (2>)a)(''!)(n.)<^'^^>’ 


(6S) 

(6c) 

(6d) 
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When  j  =  1,2  and  3,  we  have 


-|K  2*. 


Gi 

Ga 


(XT) 


-1 


2n.<^^)+X6™^ 


for  T< 


2m 

m 


(JfT)>j  I 


(7«) 

(Tt) 


-1 


The  relationship  between  the  sampling  period  T  (defined  as  Ti)  in  (7a)  and  the  sampling 
period  T  (defined  as  Ta)  in  (7b)  can  be  determined  by  solving  the  following  equation: 

=  { h  ”  [* + } 


where  A,  is  the  absolute  value  of  the  largest  eigenvalue  of  X.  Thus, 


(Ti) 


From  (7d)  we  observe  that  a  reUtivdy  larger  sampfing  period  T  can  be  used  tog  the 
apprordmatimi  ol  if  a  more  sophisticated  appcoad— ale  modd  is  vUfised. 

It  might  be  interesting  to  see  the  role  of  the  scaling  and  squaring  fiictor  m  in  the 
apprordmation  of  in  (6).  When  m  =  1,  the  matrix  G]  in  (7a)  can  be  represented  by 
a  geometric  series  as  follows: 

Gi=(/.-ijrr)-'(/.  +  5-«r)  for  T<||]j 

=  U  +  XT+  1( JTT)’  +  :^^{XT)’  +  ’  <*“) 

It  is  well-known  [3]  that  the  matrix  Gj  is  the  approximate  model  of  obtained  by  using 
the  Tustin  method  or  bilinear  transform  method.  When  m  =  2,  (7a)  becomes 

2(2) 


for  T< 


ll^ll 


=  /.  +  jrr  +  l(XTy  + 
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If  m  =  4,  (7a)  becomes 


for  T< 


=  I.  +  XT  +  i(Jf  T)-  +  r)*  +  +  •  •  •  (8c) 

The  exact  Taylor  series  expansion  of  is 
G  =  c^^ 

=  I,  +  XT  +  liXTY  +  T)>  +  r)‘  +  + . . .  (9) 

Comparing  (8)  and  (9)  we  observe  that  the  first  three  terms  in  all  equations  are  identical 
and  each  of  the  weighting  factors  for  the  other  terms  {XTy  in  (8)  approaches  the  corre¬ 
sponding  terms  in  (9)  as  the  value  of  m  increases.  Therefore,  (8c)  is  a  better  approximate 
model  than  those  in  (8a)  and  (8b). 

The  inverse-geometric  series  approximation  of  can  be  represented  as  follows. 
From  (7a)  we  have  the  inverse-geometric  series  approximations  as 


(lOtt) 


1  r  1  r  1  1  1  XT  '1  ”* 

V  nr  1  ^  r  yht*  l  ^ 

=  (G\I’^  -  Kt  —  )-'  “  (G''” (106) 

m  J  [  m 


TTl  ^  Tn  “  — 


(lOc) 


The  inverse-geometric  series  approximations  in  (10)  can  be  justified  by  the  same  reasoning 
ais  the  geometric  series  approximation  in  (7a).  Note  that  when  m  =  1  in  (10),  the  inverse- 
geometric  series  approximations  become  the  inverse-bilinear  approximations. 

When  A”  =  Ao  +  AA,  the  sampling  period  T  for  the  sufficient  condition  ||  AT||/27n  <  1 
in  (7a)  can  be  derived  as  follows: 


Since 


II jflir  .  Me  +  AAiir  ^  (iMdII  +  ||ax||)t  ,  ^ 

2m  2m  2m 


(lla) 


uencc. 


2m 


M.II  +  I|aa||- 

Also,  &om  (7b)  we  bave  the  inverse-geometric  series  approximations  as 


(JU) 


=  G,  a  e  =  «-»»■ 


>•1 


(12.) 


=  [(Gi^”  -  /.)(^)->|”  a  |(G>/"  -  /,)(:^)-iJ"  (IJ,) 

*[/.-^(jrT)+^(xr)>]’e.  («.) 


In  this  p^cr,  we  cany  out  the  model  conrerBoiu  by  coacentnting  on  the  bi|^i*o«dcr 
biBneay  appyoodmate  model  in  (7a)  with  m  s  2  which  is  identical  to  the  appeoadmate 
model  in  (7b)  with  m  a  1.  Then  we  compan  the  <d>tained  modd  with  the  existing  modd 
in  (5),  which  was  obtained  by  a  low-order  bilinear  approximation  method. 

4.  Model  Conversions  Using  a  High-Order  Bilinear  Approximation  Method 
Substituting  X  =  Aq  +  AA  into  (6)  and  (7a)  having  m  =  2  gives 


~r 

Hence,  we  obtain 

g;/’  -  /,  = 


(Xo  +  AXjT 


-1 


/n  + 


(Ao  +  AA)r 


for  T< 


IMoll  +  IIAAII 


(13a) 


(Ao  +  AA)r 


)-('- 


(Ao-hAA)ry 
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=  I  f/,  _  (itct-AX)  for  r< 


llit.ll  +  IIAXII 

«;/. ,  .  (iL±^]  -  ((/.  .  +  (/.  -  ih±^)] 

for  T< 


=  2 


,  (X.  +  Ait)Tl 

’  « 


l|i<.ll  +  ll^itg 


(u‘) 


and 


(c!'*  - +  /.) 


=  T 


(A.  +  A^). 


(13c) 


In  the  same  manner,  we  substitute  X  =  into  (6)  and  (7a)  with  m  =  2  jrields 

r..  AoT._i  .  .  4 


-  /n  =  ^{In  -  ^)-‘Ao  for  T  < 
AoT,.,  .  _  4 


Moll 


Gj/*  +  /„^2(/„--^)-*  for  T< 
Go-In  =  T{In  -  ^)-^Ao 


Moll 


(14a) 

(14fc) 

(14c) 


where  Go  =  Also,  Substituting  X  =  Aq,  m  =  2  and  G  =  =  Go  in  (10c)  results 

in 

A  T  AT...  A 

(14d) 


+  foe  2-<^. 


The  approximate  model  in  (13c)  can  be  further  analyzed  as  follows. 


G  -  i„  ^  G,  -  /„ 


=  Kn- 


(Ao  +  AA)T] 


-2 


4 

J 

AoT 

A  AT] 

4 

4 

AoT  1 

(AoT)> 

(Ao  + AA)r 


In- 


AoT  AATl 


]}-<i 


0  +  AA)r 


_  rAAT 


L  2  + 


+  0(7’^AA^)|  (Ao  +  AA)r 


S(W^o"*-Fo)-*(Ao  +  AA)r 


(15a) 
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where 


! 

Wo 


A 


AqT  (ApTy 
2  16 


Vo  =  \aAT  -  f  AAAo) 


(15fc) 

(15c) 


Note  that  the  nonlinear  uncertain  term  0(T^AA^)  in  (15a)  has  heen  neglected.  Hence, 


6-1,3  W,(I,  -  V.tV,)-‘(,lo  +  AA)T 

=  Wo  [/„  +  VoWo  +  O  ((VolVo)*)]  (Ao  +  Ad4)r 

^Wo{In  +  VoWo)iAo  +  AA)T  for  l|VoTVo||  <  1.  (ISrf) 


Note  that  the  nonlinear  uncertain  term  O  ((VqWo)*)  =  0(T*AA*)  in  (15d)  is  neglected. 
The  sampling  period  T  satisfies  the  convergent  condition  ||VolVo||  <  1  in  (15d)  as  shown 
below: 

Since  ||  VolVo||  <  ||  Vo||||PVo||>  can  use  the  relationship  in  (12b)  with  X  =  Ap,  m  =  l  and 
G  =  Go  to  find  lIlVoll,  i.e., 


IIW.II  “  ||{C.  -  /„)(A,T)->||  =  II  f;  i(A.r)‘->||  <  f;  illAsrii-' 

»sl 

l|A.r||  M.rr  p,r||»  ,  M.rr  ,  p.r||»  . 

2!  3!  4!  (5)(4!)  (6)(5)(4!)  ' 


Also, 


Therefore 


.  ,  ,  ll^.r||  ,  ||A.T||>  ,  M.T||> 

<  ‘  +  "IT"  +  “3!“  +  “4!” 

=  1  +  IM!!  + 


4  4‘‘ 


2! 


3! 


4! 


1  Uon 

^  4 


1  +  MoTJI  Jl^oTf 
^  4  ~  24 

1  _  MoJ’ll 

^  4 


II  Ao 


IIVoll  <  ^||AA||  +  ^(2||4o||||A,4||)  =  |||AA|1  +  ^||Ao|||| AA||. 


,  ^Mon|AA||  +  ^||A.n|AA|l  +  E||A„||||AA||  +  2||AA||  , 
"  ^  1-2||A.||  ^ 


ST’  I 
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i.e. 


rp4  I^rpi  rp2  rp 

^llAoiniA^II  +  ^Poll’llA^II  +  :L|1Ao||||AA||  +  i(||Ao!|  +  2||AA||)  -  1  <  0  (15e) 

Solving  the  inequality  in  (15e)  for  T  results  in  the  sufRcient  condition  for  the  sampling 
period  T  such  that  \\VoWo\\  <  1  and  (/„  -  VqWo)-^  =  /„  +  VqWo  +  0((VoW^o)^).  When 
the  sampling  period  T  in  (15e)  is  sufficiently  small,  (15e)  can  be  reduced  to 

^(||A„||  +  2||A^||)<1.  (15/) 


As  a  result, 


^  I1A.II  +  2||AA||  IIAoll  +  1|AA||  ^  pj ' 

The  results  in  (15e)  and  (15g)  show  that  when  the  sampling  period  T  satisfies  the  inequal¬ 
ities  in  (15e)  and  (15g),  then 


(/„  -  FoWo)-'  =  Ir,  +  VoWo  +  0{T^AA^)  ^  +  VoWo.  {15h) 


Now  we  are  ready  to  find  the  discrete-time  uncertainties  (AG,  AH)  in  (3)  from  the 
continuous-time  uncertainties  ( AA,  AB)  in  (1)  by  neglecting  any  nonlinear  uncertain  terms 
O(r^AA^)  and  0(T^AAAB)  in  the  following  manner. 

From  (13a)  and  (14)  we  have 


G^Gi  = 


/n--(Ao  + AA)T 


{ 
I 

I 

{g; 
{ol 


In-^{Ao  +  AA)T 


-1  r 


{In  +  lAoT)  +  ^AAT 


r 


T-1 


-f  AA)r 


(/n-iAoDGj/'+iAAT  I 

(/„  -  i( Ao  +  AA)r)  +  i AAT(Gy'  +  /„)  | 


+ 


7„--(Ao-hAA)r 


aa(g;''"+/,)| 

1_1  N  2 

AA(Gy’ + /„)  I 
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=  |gJ'*  +  ^ AA(/.  -  \a>T)-'  +  0(r>AX»)lA^(G;'’  +  /.)  j 

“  +  7(^»  -  j  Aor)->AA(G;'*  +  /.) 

+(|)’(^»  -  j  A,T)-»AX(/.  -  iXoT)->AA(G;'’  + 1  A 

=  |g;'"  +  ^(/,  -  1  A,r)->AA(G;'’  +  /.)  +  0(T>AX»)  V 

“  -  i^oT)-'AX(G;'’  +  /„)| 

=  G,  +  ^Gj'*(J.  -  Ii4,r)-'AA(G;'’  +  /.) 

+  !(/,  -  i  A.T)->AA(Gj'’  +  U)Gl>^  +  0(T*AA») 

4  4 

S!  G.  +  iG;'’(G;'’  -  /„)Aj‘AX(Gj''  +  /.)  +  -  J„)A.-'AA(G;'’  +  J.jGj'’ 

=  C,  +  AG,  (16a) 

where 

Go  =  (161) 

G','^  =  (16c) 

AG,  =  iG;/’(G;/’  -  l.)A^ ’  AA(G;^  +  /.) 

+  1(gJ/’  -  /„)AJ’AA(g;''  +  J„)gJ/’.  (16J) 

In  order  to  derive  the  relationship  between  the  obtained  uncertain  system  matrix  AGj  in 
(16d)  and  the  uncertain  system  matrix  [6]  as  shown  in  (5a),  we  modify  the  representation 
in  (16d)  as 

AG,  =  i(Gy’  +  /„)(Gj'’  -  /„)Ao-*AA(G»  +  /,  -  G„  +  GJ/’) 

-  \(0T  -  /»)A„->  AA(G;'’  +  /,)  +  i(cy’  -  /.)A,->AA(Gy’  +  I^)g\I' 

=  AG,  +  i(G;'’  -  J„)A„-*AA(Go  -  /„) 

-  5(0,  -  /„)A,->AA(Gy’  -  U)G\I'  (17a) 
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where  the  uncertainty 


AC,  =  i(G,-/.)VA^(Go  +  /.)  (IW) 

is  the  discrete-time  bilinear  and  inverse-bilinear  uncertun  system  matrix  [6]  as  shown  in 
(5a).  Also, 

fT 

H=  e<^»+^^)‘Bd<  =  (G-/n)(Ao  +  AA)-*(So  +  AS).  (18o) 

Jo 

Utilizing  the  result  in  (15d)  gives 

H^Wo{In  +  VoWo)iBo-\-AB)T  for  ||VoW^o||  <  1 
=  WoBoT  +  WoVoWoBoT  +  WoABT  +  W^o^oW^oABT 
=  Ho  +  WoVoWoBoT  +  WqABT  -|-  0{T^AAAB) 

—  Hq  +  AHi  (1^^) 

where 
Go  = 

r  1  1  1 

/n--Aor  +  -i(Aor)2  ^{Go-In)iAoT)-^ 

Z  .  LO  J 

Vo  =  -  AAoT  —  —  (AoAA  +  AAAo) 

Ho  =  {Go  -  Ir.)Ao'Bo  ^  WoBoT 
AFi  =  WoVoWoBoT  +  WoABT 
=  (Go-/„)Ao-^  ^AA- 

=  AHh  —  ~  ^n)-^o  *(AoAA  +  AAAo)^o 

where  the  uncertainty 

AHf,  =  (Go  -  In)Ao^AB  +  ^(^o  -  Ir,)A^^AAHo  (18d) 


—  (AoAA  +  AAAo)  Ho  +  {Go  -  U)A-^ AB 

lb 


is  the  discrete-time  bilinear  and  inverse-bilinear  uncertain  input  matrix  [6]  as  shown  in 


The  counterpart  of  AG  (i.e.  AA)  can  be  determined  from  (16d  )  by  solving  the 
following  Lyapunov  equation  [5]: 


g;"AX  +  AAGj'’  =  2X„(Gy'  -  /.)-‘AG(Gi'’  +  /.)-■ 


(19.) 


where 


^0  =  y  M^o)- 


(199) 


A  computation  method  is  available  in  [2]  for  computing  the  matrix  logarithm  function  in 
(19b). 

The  counterpart  of  AH  (i.e.  AB)  can  be  directly  obtained  from  (18c)  as 


where 


AS  =  Ao(Go-/n)-*|AJy-(f?o-/n)V 


--(AoAA  +  AAAo) 


(20a) 


A.(i  =  —  In(Go) 

Bq  =  Ao(Go  —  In)~^HQ. 


{20b) 

(20c) 


For  the  comparison  of  the  aforementioned  models  in  (5),  (16),  and  (18)  with  the 
commonly  used  Taylor-series  approximate  model,  we  derive  the  Taylor-series  approximate 
model  as  follows: 

G  =  ^  +  {Ao  +  AA)r+  ^(A®  +  AA)’T’ 

=  (/.  +  AoT  +  i  Ajr’)  +  AA(r/,  +  ^Ao)  +  (^A.)AA  +  0(r”AA") 

—  Go  +  AG|  (^1^) 


where 


Go  = 

AG,  =  AA(r/„  +  ^  Ao)  +  (^Ao)AA. 


es9 


(2U) 

(21c) 


The  corresponding  discrete-time  input  matrix  becomes 


=  (G  -  /n)(Ao  +  AA)'HBo  +  AB) 

=  r(/n  +  ^  Aor)Bo  -I-  ^  AABo  -1-  (T/„  -f  y  Ao)AB 
=  Bfl  +  ABt 


(22a) 


where 


Bo  =  (Go  -  /n)Ao  *Bo 

AB*  =  AA(^Bo)  +  (T/„  -h  ^  Ao)AB. 


(226) 

(22c) 


Also,  the  counterpart  of  AG(  (i.e.  AA^)  can  be  determined  from  (21c)  by  solving  the 
following  Lyaptmov  equation: 


AAt{TIn  +  —  Ao)  +  {—Ao)AAt  =  AG 


(23a) 


where  Ao  =  ^ln(Go).  The  counterpart  of  AB(  (i.e.  AB^)  can  be  directly  solved  from 
(22c)  as 

=  +  (23i) 

where  Bq  =  Ao(Go  -  /„)~^Bo. 


5.  Illustrative  Example 

The  unstable  dynamics  of  a  helicopter  in  a  vertical  pleine  for  an  airspeed  range  of 
60  ~  170  knots  are  given  in  [8,  9].  The  nominal  and  uncertain  system  matrices  are 


Ao  — 


-0.0366  0.0271  0.0188  -0.4555' 

0.0482  -1.010  0.0024  -4.0208 

0.1002  0.2855  -0.707  1.3229  ’ 

0  0  1  0  . 


Bo  = 


0.4422  0.1761  ' 

3.0447  -7.5922 
-5.52  4.99 

.  0  0  . 


AA  = 


■0  0  0  0  ' 

0  0  0  0 

0  A„a  0  A,J  ’ 

.0  0  0  0  . 


AB  = 


0  0 

Ail  0 

0  0 

.  0  0 


where  |A.i|  <  0.2192,  jA.al  <  1.2031  aud  |An|  <  2.0673. 
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The  above  perturbation  parameters  can  be  normalised  as  A«i  =  A^a  =  Asi  =  Aj^  = 
±1.  Thus  AA  =  Aj^Ai  +  Aj^Aa  and  AjB  =  Aj^Bj  +  Aj^Ba,  where  =  0^x2 


Ai  = 


0  0  0  0 

0  0  0  0 

0  0.2192  0  0 

0  0  0  0 


Aa  = 


Let  the  sampling  period 


T  =  0.2  < 


0  0  0  0 

0  0  0  0 

0  0  0  1.2031 

0  0  0  0 


IMsIl  +  IIAAir 


B,  = 


0  0 

2.0673  0 

0  0 

0  0 


The  exact  uncertain  system  matrix  of  the  discrete-time  model  for  jA^I  =  1  =  e  are 

'  0.00000  -0.00005  -0.00001  -0.00028 

AC  =  _ -AoT  _  .  -0.00000  -0.00103  -0.00028  -0.00571 

e  e  QQQQ34  0.03796  0.02114  0.21400 

.  0.00002  0.00397  0.00145  0.02216 


A^  =  -  U){Ao  +  cAi  +  eAa)-*(S«  +  cB,)  -  (e^^  -  /4)Ao  >5, 

0.00104  0.00001 

0.37403  0.00034 

~  ^  0.02285  -0.J::::36 
.0.00170  -0.00168. 

The  uncertain  matrices  (ACrj,  AB}),  obtained  by  using  the  high-order  bilinear  and 
inverse-bilinear  approximation  method  with  m  =  2,  can  be  computed  from  (16d)  and  (18c), 
respectiydy,  as 


A^i  =6 


0.00000  -0.00005  -0.00001  -0.00027 

-0.00001  -0.00101  -0.00034  -0.00566 

0.00035  0.03774  0.02075  0.21269 

.  0.00003  0.00393  0.00159  0.02201 


AJ7,  =e 


0.00104  0.00001 

0.37416  0.00038 

0.01365  -0.02166 
0.00103  -0.00172 


The  associated  errors  are  ||ACr  —  AGi||/||A(7||  =  0.00633,  and  ||AJ7  —  AJ7i||/||AJ7||  = 
0.02484. 

Also,  the  uncertain  matrices  (AG^,  AB^),  obtained  by  using  the  low-order  bilinear 
and  inverse-bilinear  approximation  method  [6],  can  be  computed  &om  (5a)  and  (5b),  re¬ 
spectively,  as 


AGk  =  e 


'  0.00000 

-0.00005 

-0.00002 

-0.00027- 

-0.00104 

0.00002  - 

-0.00001 

-0.00099 

-0.00052 

-0.00556 

,  ABfc  =  c 

0.37424 

0.00048 

0.00039 

0.03801 

0.01980 

0.21384 

0.01067 

-0.01831 

.  0.00004 

0.00387 

0.00202 

0.02180  . 

.0.00073 

-0.00187. 
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The  associated  errors  are  ||A(?  —  AGk||/||AC?||  =  0.00673,  and  ||A£r  —  AJ7t||/||AJ7||  = 
0.03477. 

Moreover,  the  imcertain  matrices  (AG^,  AJ7t)i  obtained  by  using  the  Taylor  series 

approximation  method,  can  be  computed  from  (21c)  and  (22c),  respectively,  as 

-0.00000  0.00008  0.00000  0.00045-]  r0.00112  0.00000  ' 

^  _  0.00000  0.00001  0.00000  0.00006  A  IT  _  0.37170  0.00000 

0.00021  0.03631  0.02407  0.20598  «  0.02515  -0.03328 

.0.00000  0.00438  0.00000  0.02406]  Lo.OOOOO  0.00000  . 

The  associated  errors  are  ||ACr  —  ACrt||/||A(?j|  =  0.04815,  and  ||AJ7  —  AJ7t||/||Air||  = 
0.02901. 

The  proposed  approximate  models  are  quite  satisfactory. 

In  order  to  demonstrate  the  sensitivity  of  the  proposed  method  to  the  sampling  pe¬ 
riod  {T  <  [[^^||^^^||)  and  the  uncertain  matrices  (Ai4,  AB),  the  comparison  of  relative 
errors  of  the  exact  matrices  {G,H)  and  any  proposed  approximate  model  i.e., 

(||G  —  G*||)/||G||  and  (||B  —  Ba||)/||jEr||,  is  presented  in  Fig.l  and  Fig.2  with  A^  =  1  and 
in  Fig. 3  and  Fig.4  with  A^  =  0.1.  It  is  observed  that  when  the  uncertainties  (AA,  AB) 
are  sufficiently  small,  the  obtained  uncertain  linear  model  using  a  high-order  geomet¬ 
ric  series  approximate  model  with  a  relatively  large  sampling  period  often  gives  a  bet¬ 
ter  result  than  those  of  lower-order  geometric  series  approximate  models  with  the  same 
sampling  period.  On  the  other  hand,  for  relatively  large  uncertainties  (AA,  AB),  the 
above  observation  may  be  not  true  due  to  the  fact  that  the  neglected  nonlinear  terms, 
0(r>AA^)  =  and  0(T=AAAB)  =  01(p;,^)^AAAB]. are  not 

sufficiently  small. 

6.  Conclusion 

A  geometric  series  method  along  with  the  inverse-geometric  series  method  has  been 
proposed  to  find  the  discrete-time  (continuous-time)  structured  uncertain  linear  models 
from  the  continuous-time  (discrete-time)  structured  uncertain  linear  systems.  This  allows 
the  use  of  well-developed  theorems  and  algorithms  in  the  discrete-time  (continuous-time) 
domain  to  solve  the  continuous-time  (discrete-time)  domain  problems  indirectly.  It  has 
been  shown  that  uncertain  system  parameters  are  propagated  into  the  uncertain  input  ma¬ 
trix.  Moreover,  from  Fig.3  and  Fig.4  it  has  been  shown  that  for  a  relatively  large  sampling 
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period  and  relatively  small  uncertrin  parameters  (e  <  l),  the  models  obtained  by  using 
a  high-order  geometric  series  approximation  method  are  much  better  than  those  obtained 
by  using  the  low-order  geometric  series  approximation  method.  A  numerical  example  is 
presented  to  illustrate  the  proposed  procedures  and  to  demonstrate  the  effectiveness  of  the 
proposed  method. 
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1  Introduction 

In  this  paper  a  new  approach  for  target  tracking  and  recog¬ 
nition  is  presented.  We  take  a  Bayesian  approach  and  de¬ 
fine  a  prior  density  on  the  scenes  of  targets  and  combine  it 
with  likelihoods  based  on  the  sensor  data  to  give  a  poste¬ 
rior  measure  on  the  parameter  space.  The  jump-diffusion 
random  sampling  algorithm  [1,  2,  3]  is  used  to  sample  from 
the  posterior. 

In  Section  2  the  basic  approach  for  solving  the  problem 
is  described.  The  global  shape  model  approach  is  described 
in  section  3  with  the  Bayesian  posterior  measure  derived 
in  section  4.  Section  5  illustrates  the  use  of  jump-diffusion 
processes  for  random  sampling  from  the  posterior  measure 
over  the  pcirameter  space.  The  last  section  describes  the 
implementation  on  a  parallel  processing  maichine  and  the 
results  obtained. 

2  Recognition  Via  Deformable 
Templates 

A  fundamental  task  in  the  representation  of  complex  dy¬ 
namically  changing  scenes  involving  rigid  targets  is  the 
construction  of  models  that  incorporate  both  the  vuiabil- 
ity  of  orientation,  r2U]ge,  euid  object  number,  as  well  as  the 
precise  rigid  structure  of  the  objects  in  a  mathematically 
precise  way.  The  global  deformable  shape  models  intro¬ 
duced  in  Grenander’s  general  pattern  theory  [4]  extended 
to  parametric  representations  of  arbitrary  unknown  model 
order  [1,  2]  are  intended  to  do  this.  This  becomes  the  basis 
for  our  approach. 

There  are  various  kinds  of  variability  and  uncertainty 
inherent  to  data  obtained  via  remote  sensing  via  high  res¬ 
olution  and  tracking  radars.  The  first  and  foremost  vari¬ 
ability  is  associated  with  the  conformations  of  the  rigid 
bodies;  orientations,  scales,  and  position.  To  accommo¬ 
date  this  type  of  variability  we  use  global  templates  which 
are  made  flexible  via  the  introduction  of  basic  transfor¬ 
mations  involving  both  rigid  motions  of  translation  and 
rotation,  as  well  as  non-rigid  motions  such  as  scale.  As 
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these  transformations  2tre  parameterized  by  positions  in 
3?^  and  orientations  in  [0,2x)^  x  [0,  x],  they  are  performed 
using  continuous  stochastic  gradient  search.  The  second 
kind  of  variability  is  associated  with  the  model  order,  or 
patrametric  dimension.  In  any  scene  there  may  be  multiple, 
variable  numbers  of  targets,  with  tracks  of  variable  lengths 
ctnd  the  target  number  unknown  apriori  and,  therefore,  to 
be  estimated  itself. 

We  tedce  a  Bayesian  approach  by  defining  a  prior  density 
on  the  scenes  of  targets.  The  prior  coupled  to  the  sensor 
data  likelihood  gives  the  Bayes  posterior.  The  conforma¬ 
tion  is  selected  to  be  consistent  with  the  data  in  the  sense 
that  scenes  of  high  probability  under  the  posterior  distri¬ 
bution  are  selected.  Our  method  for  generating  candidate 
conformations  is  to  sample  from  the  posterior.  For  this 
a  new  class  of  random  sampling  algorithms  is  used  based 
on  jump-diffusion  dynamics,  introduced  in  Grenander  and 
Miller  [1,  2,  3]  which  visit  candidate  solutions  according  to 
the  posterior  density.  The  original  motivation  for  introduc¬ 
ing  jump-diffusions  is  to  accommodate  the  very  different 
continuous  and  discrete  components  of  the  discovery  pro¬ 
cess.  Given  a  conformation  associated  with  a  target  type, 
or  group  of  targets,  the  problem  is  to  track  and  identify 
the  orientation,  translation  and  scale  parameters  accom¬ 
modating  the  V2uriability  manifest  in  the  viewing  of  each 
object  type.  For  this,  the  parameter  space  is  ssunpled  us¬ 
ing  Langevin  stochastic  differential  equations  in  which  the 
state  vector  continuously  winds  through  the  translation- 
rotation-scale  space  following  greidients  of  the  posterior. 
The  second  distinct  part  of  the  sampling  process  supports 
the  recognition  associated  with  choosing  the  target  types. 
The  deduction  algorithm  goes  through  multiple  stages  of 
hypothesis  during  which  the  airplane  types  are  being  dis¬ 
covered,  euttd  some  subset  of  the  scene  may  be  only  par¬ 
tially  “recognized.”  This  is  accommodated  by  defining  the 
second  transformation  type  which  jumps  between  different 
object  types,  where  a  jump  may  correspond  to  the  hypoth¬ 
esis  of  a  new  object  in  the  scene,  or  a  “change  of  mind" 
about  an  object  type.  The  jump  intensities  are  governed 
by  the  posterior  density,  with  the  process  visiting  confor¬ 
mations  of  higher  probability  for  longer  exponential  times, 
and  the  diffusion  equation  governing  the  dynamics  between 


jumps. 
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WV  VIS.'  till'  global  shape  models  and  pattern  theoretic  ap- 
proa.'h  introduced  by  Crenander  [5,  •!].  As  the  basic  build¬ 
ing  blocks  of  the  hypotheses  define  the  set  of  generators  Q, 
till'  targi'ts  placed  at  the  origin  of  the  reference  coordinates 
at  a  lix.'d  orientation,  position,  and  unit  sc.ale. 

Tlio  fundamental  variability  in  target  spacjis  is  jvccommo- 
dat.'d  by  applying  the  transformations  T(^),7'(p),T(s)  to 
III.'  templati's  <7  according  to 
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where  4'  is  the  triple  of  rotation  angles  associated  with  the 
rotation  of  the  viewing  sphere  about  each  object,  p  arc 
the  translation  parameters  in  9?^,  and  the  s  is  the  scale 
parameter  in  3?4..  These  parameterized  transformations 
op.'rat.'  on  the  template  targets  of  Q  generating  the  full 
larg.'t  s|)ace. 

Fignr.'  1  shows  one  of  the  ,'{-D  ideal  targets  used  for  all  of 
I  In' siiinlations  b.'low.  'I'lie  left  jianel  shows  a  rendering  of 
the  targ.'t  ;it  the  origin,  th.'  right  pan.'I  showing  the  residl. 

I  'I  .iiiiilying  one  of  I  he  I  ransfciriu.it  ions. 


I'ignre  1;  :i-l)  target  at  th.'  origin  (left  jianel)  and  after 
applying  transformations  (right  panel) 


3.1  The  Parametric  Space 

Now  the  parametric  space  parameterizing  the  Bayes  poste¬ 
rior  becomes  the  set  of  parameters  specifying  the  similarity 
transformations,  as  well  as  the  airplane  type.  Define  the 
space  containing  orientations  4  as  Ad(3)  =  [0,25r)^  x  (O.rr] 


with  the  scale  parameter  belonging  to  9^4..  Then  associ¬ 
ated  with  each  target  or  generator  €  (7  is  a  parameter 
vector  X  €  Ad(3)  X  x  x  .4,  where  (.41  =  \g\  the 
number  of  different  target  types. 

A  pattern  will  be  constructed  from  multiple  targets  with 
varying  track  lengths.  In  [6]  we  have  described  the  multi¬ 
target  scenario.  Here  we  focus  on  single  target  scenes.  We 
are  interested  in  tracking  and  recognition  in  “hostile/non¬ 
cooperative”  environments  in  which  the  objects  can  appear 
and  disappear  on  random  times  Ti,T  +  Ti  €  [0, oo)  with 
T  >  0.  The  parameter  vector  associated  with  track  be¬ 
comes 

XT  G  (A4(3)  x^+xAj  X  94  . 

As  tracks  will  be  discretized  to  sample  times  ,  <2.  •  •  •.  with 
the  object  entering  and  leaving  at  ni,n  +  rti  respectively, 
the  parameter  vector  associated  with  an  n-length  track 
is  an  element  of  .Vn  =  (A4(3)  x  9J^  x  9?^.  x  >1)"  x  Z+. 
Since  n  is  unknown,  the  full  parameter  space  becomes 

CX> 

AT  =  y  (a4(3)  X  9?=*  X  9f+  X  x  2+  .  (4) 

n=0 

The  posterior  density  defined  over  the  full  parameter 
space  X  is  assumed  to  be  of  Gibbs  form 


n{xn)  = 


Z 


(5) 


In  the  work  presented  here  only  rigid  transformations  are 
used  with  .v  =  1.  Also  we  assume  the  enterance  time  ) 
known  apriori. 


4  The  Bayes  Posterior 

I  h.'  ideals  /  G  X  ar.'  what  ran  b.' obs.'rv.'.l  by  an  id.'al  (willi 
no  loss  of  iiironnal  ion)  observer.  Th.'  art  nal  observ.-r.  liow- 
I’V'  r.  in.iy  only  In'  abh'  to  s.'e  the  i  leni.'iits  with  |o>s  of 
inlorni.'ii  )■  >n  due  to  iirojeri  ion,  nliserv’at  ion  iioi^e  and/or 
inint.'i  .'iri  nrai  v  in  tlie  sensor.  Denote  llu'  opei-.iiion  by 
wlll'  h  till'  I'leal  /  apjieai'S  as  some  ob]ei'|  s.'l\'  /■  .  bv  de 
iormaiioiis  <1  :  [  —  ,  J  g  P.  where  P  is  tlie  s.'t  of 

delormation  mi't  h.'inisms.  bot  h  random  and  deierminisi  ir. 

We  l.ake  a  Ib'iyesian  tipproach  to  the  genertil ion  of  ran- 
didate  .scenes  by  ilefining  a  ])osl('rior  probability  of  I  hi'  pa¬ 
rameter  vector  x„  representing  ideal  /  given  the  metisured 
data  according  to 

7r(x„)  =  oc  1,  ^ 

^  Yj 

where  L(/®|x„)  is  the  potential  associated  with  data  like¬ 
lihood  and  P{x„)  is  the  potential  associated  with  the  prior 
density  on  the  parameter  space. 

In  the  problem  stated  here  the  data  7®  has  multiple 
components  corresponding  to  the  various  sensors: 
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We  only  include  two  sensors,  one  for  tracking  and  one  for 
high-resolution  imaging. 


4.1  Tracking  Priors 

The  prior  on  track  formation  is  based  on  the  dyncimics  of 
target  motion  and  follows  that  described  in  Srivastava  et 
al.  [7]  in  which  the  force  equations  governing  the  motion 
of  targets  cire  utilized  to  form  a  prior  density  on  the  track 
parameter  space.  As  an  object  moves  in  3  space  it  traverses 
a  continuous  path  consisting  of  a  sequence  of  translation 
locations  p{t)  €  3^^-  For  describing  their  dynamics  we  shall 
be  interested  in  expressing  the  tracks  in  terms  of  the  bo<ly 
frame  velocities  of  the  objects,  ^(1)  G  W'*  which  are  related 
to  the  inertial  positions  [8]  according  to 


P(t)=  I 


<i>(T)r{T)(lT  +  p(7i ). 


where  ‘l>(r)  is  the  product  of  thri'e  rotation  matrices  in  Kqii 
(■_’)  .\s  in  [7]  the  rigid  body  analysis  with  the  assumptions 
that  the  earth's  curvature,  motion  ami  wind  •  tfects  are 
negligible  implies  that  the  translational  motion  is  given  by 
the  Newtoniati  vector  ecpiation 

r(i)-¥  A(0(i))v(t)  =  nn-  (<) 

Here  <s(<)  €  .'d(3)  are  the  Euler's  angles  representing  the 
orientation  of  the  target  with  respect  to  the  ground  based 
inertial  frame  and 

0  -qz(l)  q2(t)  " 

A(m)=  93(0  0  -9i(0  , 

.  -92(0  9i(0  0  , 

with  ^t)  the  rates  of  change  of  orientation,  which  are  func¬ 
tions  of  the  Euler’s  angles. 

This  linear  differential  equation  is  characterized  by  the 
time-varying  parameter  matrix  A{4){t))  and  force  vector 
/(<).  The  covariance  is  induced  following  the  approach 
of  Srivastava  et  al.  [7]  and  Amit  et  al.  [9]  by  assuming 
the  forcing  function  is  a  white  process  with  mean  /(<)  and 
a  fixed  spectral  density  <t,  which  then  induces  a  Gaussian 
process  v{t)  with  mean  v(t)  and  covariance  operator  deter¬ 
mined  by  the  differential  operator  of  Eqn.  (7)  according 
to  ^ 

=  f  e~  +  ^Ti)  (8) 

Jt, 

L-  14  \  r  A(4>(r))dT  -  /'  .4(^,(r))dT  . 

h^(t,s)  =  (T  [e  ][e  ft/fl. 

JTi 

(9) 

Using  Eqn  (6)  the  prior  density  on  the  airplane  positions 
can  be  written  as 

^'p(i,s)=f  f  ${ri)A'„(ri,7-2)$^(r2)dridr2  .  (10) 

JTi  JTi 

Notice,  this  covariance  is  parameterized  by  the  sequence 
of  airplane  orientations  G  [Ti,T2).  This  connects 

the  tracking  and  recognition  algorithms. 


4.2  The  Likelihood:  Tracking  and  Imag¬ 
ing  Data 

There  are  two  sensor  types  in  our  problem:  a  tracking 
sensor  and  a  high-resoluiton  imaging  sensor. 

4.2.1  Tracking 

For  the  tracking  we  assume  a  narrowband  tracking  cross 
array  as  in  [10,  6,  7,  11]  using  the  standard  narrowband 
signal  model  developed  in  [12].  The  uniform  cross  array 
fonsist.s  of  two  uniform  linear  arrays  orthogonal  to  each 
other,  sensitive  to  the  range.  ele\  Mon.  and  azimuth  loca- 
tion.s  of  the  targets  related  to  the  inertial  positions  ;7(t) 
through  regular  coordinate  transformations. 

For  the  data  collected  at  the  P-element  sensor  arra\  at 
time  t  the  superposition  of  tlu  incoming  signal  aiul  t!i. 
ambient  noise  becomes 

d(,-,ict(0  =  d(p(0)*’(0  +  n(t).  (11) 

where  11(1)  is  ii  P  X  1,  O-meaii  complex  gaussian  random 
vector  with  identity  covariance.  s(/)  is  the  sigiuil  vahu'  and 
tl(p(/))  is  a  regular  F  x  1  vandermotuie  direction  vt'cior 
with  the  tingles  of  signal  arrival  parameterized  by  the  iner- 
tial  postilon  p(l).  The  dt  tt  rintiiistir  signal  niodd  is  iiseil 
as  in  [10]  in  which  the  mea.snrements  y{t)  are  Gaussian 
distributed  with  mean  d(p{t))s(t). 

4.2.2  Imaging 

While  we  are  currently  incorporating  models  for  high- 
resolution  radar  imaging  as  described  in  [13,  14,  15,  16, 
17,  18],  all  of  the  results  shown  are  based  on  optical  imag¬ 
ing  systems  in  which  the  data  is  tissumed  to  be  on  a  dis¬ 
crete  square  64  x  64  lattice  C.  The  imaging  data  at  time 
t  is  the  set  of  4096  grey  scale  pixel  values  drrcogit)  = 
{di{t),i  G  C].  The  deformation  of  the  imaging  process 
of  the  ideal  targets  is  assumed  here  to  be  projection  with 
additive  noise.  For  the  simulations  shown  below  a  Gau.s- 
sian  noise  model  was  used,  with  the  measured  data  having 
mean  the  true  ideal  3-D  image  projected  onto  the  2-D  lat¬ 
tice  space. 

Shown  in  Figure  2  are  the  two  kinds  of  data  which  the 
algorithms  are  based  on.  The  left  column  shows  the  ideal 
projected  onto  2-D  t>dth  additive  noise  at  three  different 
time  instants.  This  is  representative  of  the  optical  imag¬ 
ing  component  of  the  algorithm.  The  right  column  show; 
the  spatial  power  spectrum  from  the  narrowband  tracker 
at  three  instants  of  time,  plotted  in  the  azimuth-elevation 
plane  (bright  is  low  power,  dark  is  high  power). 

The  two  measurements  become  /f*  =  {dtracit(0> ^  € 
[0,oo)},  If  =  {dr«o,(t),<  G  [0,oo)}. 

5  Jump-DifFusion  Random  Sam¬ 
pling  Algorithm 

The  most  crucial  part  of  the  problem  still  remaining  is 
the  derivation  of  the  inference  algorithm  for  generating 
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I'igiiie  2:  The  loft  oolunin  sliows  the  target  projected  onto 
the  2-D  lattice  with  adilitive  noirie  at  three  difforeiit  time 
instants.  The  right  column  shows  the  azimuth-elevation 
signal  power  profile  at  three  difft  rent  instants  of  time  as 
generated  from  the  narrowhand  t r.-n-king  data. 


inlerences  of  high  prol.altilii>.  (^ur  appro.a  di  i>  to  <  li¬ 
st  met  a  jump-diffusion  .Mark.n  pr.  jn  .'.-s,  f>ll  v,  ing  t  h<'  ap- 
jiroaeh  outlined  in  Clrenand-  r  .and  Mdlei  [l].  which  has 
the  limiting  property  that  it  converges  in  distriluiti<.iii  to 
the  Hayes  posterior,  d  his  jimip-ditfusion  Markov  process 
>  0}  samjiles  the  pe.,-,terior  density  g  ,1' 

di.Tined  over  the  full  parameter  space  .V  =  Uj^_o,V„,  i.e. 
the  time  samples  of  the  process  visit  the  conformations  ac¬ 
cording  to  the  posterior  density.  This  result  is  presented 
in  [l,  2,  3]  as  theorems  which  follow  from  two  fundamental 
results.  First  it  is  shown  that  /r(x„)  is  anary  mea¬ 

sure  of  the  process  {A'(<).<  >  0},  seci  t  iroof  that 
it  is  the  unique  stationary  measure  is  prese  ted.  The  first 
part  is  proven  by  showing  that  the  backward  kolmogoroff 
operator  A  associated  with  the  Markov  process  satisfies 
the  condition  Af{xn)n{dxn)  =  0,  for  /(■)  €  domain(i4) 
and  fi{dxn)  =  ir{x„)m{dxn),  the  distribution  correspond¬ 
ing  to  TT{xn)  with  m{  )  being  the  lebesgue  measure  associ¬ 
ated  with  the  parameter  space  X„.  The  generator  A  has 
two  parts,  A  =  A^  +  A'^,  corresponding  to  the  jump  and 
diffusion  terms. 


5.1  Jump  Process 


The  jump  process  travels  through  the  infinite  number  of 
subspaces  carrying  the  inference  from  subspace  to  sub¬ 
space.  The  two  kinds  of  jump  moves  allowed  here  are 
addition  or  deletion  of  tr2tck  segments  from  the  track  con¬ 
figuration.  These  jump  moves  are  performed  based  on  the 
intensity  parameters  derived  from  relative  posterior  ener¬ 
gies  of  the  configurations.  For  being  the  current  con¬ 
figuration  and  ym  a  possible  candidate,  Xn,ym  €  <T,  define 
q{xn,  dym)  zis  the  transition  intensity,  q(xn)  as  the  intensity 
of  jumping  out  of  the  spau:e  containing  F„,  and  Q(x„,dy,n) 
as  the  probability  of  transition.  These  are  related  accord¬ 
ing  to  the  relations 


q{x 


”>  =  / 

JT\i, 


qixn,dym) 


and 


Qi’^m  dj/ni)  — 


q{x„,dym) 

Qi^n) 


where  T*(ir„)  is  the  set  of  all  configurations  reachable  in 
one  jump  move  from  As  shown  in  [2]  there  are  .at 
least  two  ways  of  generating  these  jump  intensities  from 
the  posterior  density,  these  being  analogues  of  Gibb’s  sam¬ 
pling  and  Metropolis  [19]  based  acceptance-rejection.  The 
implementation  presented  here  uses  the  latter  according  to 
which  the  jump  intensity  is  defined  as 

q(x„,dy,n)  =  (12) 

where  [/(•)]+  stands  for  the  positive  part  of  the  function. 
The  backwanl  kolmogoroff  operator  A^  for  this  jump  pro¬ 
cess  is  given  by 


A^flxn)  =  + 


L. 


)Q(  '  ^l/rn  )/(  y-ni ) 


and  '  /{T„)/'(f/-r„)  =  0.  This  makes  Tr(x„j 
for  thi'  jiimp  part  of  the  process. 


stationary 


5.2  Diffusion  Process 


Hrtw.-en  jump  t raii.sil  ions  the  diffusion  process  searches 
through  the  imroimtahle  set  of  parameters  within  each  of 
tlie  siibspares  ,V„.  ft  is  a  sample  path  continuous  process 
which  essentially  performs  a  randomized  gradient  descent 
over  the  posterior  potential  E(xn)  associated  with  parame¬ 
ter  space  .V„  according  to  Langevin’s  stochastic  differential 
equation  (.SDE), 

dX{t)  =  VE(X{t))dt  -t-  V2dW{t)  (13) 


where  fF(  <)  is  the  standard  vector  wiener  process  of  dimen¬ 
sion  of  the  parameter  space  .  The  backward  kolmogoroff 
operator  A^  defining  the  diffusion  generated  by  above  SDE 
is  given  by 


>l‘'/(x„)  =  V£;(i„).V/(£„)  -I-  Y, 

i.j 


d{Xn)id{Xn)i 


and  satisfies  the  condition  A'^ f(x„)n{dxn)  =  0.  See  [1] 

for  the  proof. 
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6  Results 


The  tracking  and  recognition  algorithms  were  jointly  im¬ 
plemented  using  a  Silicon  Graphics  workstation  for  data 
generation  and  visusdization,  emd  a  massively  parallel  4096 
processor  SIMD  DECmpp  machine  for  implementing  the 
tracking-recognition  random  sampling  algorithm. 


Figure  3;  Tracking  and  recognition  using  the  jump- 
diffusion  algorithm.  Top  left  shows  the  actual  track,  the 
other  three  panels  display  the  different  stages  of  airplane 
identification  as  well  as  position  and  orientation  estima¬ 
tion. 

Using  the  flight  simulator  on  the  Silicon  Graphics,  a  pa¬ 
rameterized  flight  path  (Figure  3  top  left  panel)  was  gen¬ 
erated  and  stored.  From  this,  tracking  data  was  simulated 
on  the  DECmpp  assuming  a  narrowband  cross-array  of 
two  32-element  uniform  linear  arrays  orthogonal  to  each 
other.  Optical  imaging  of  the  space  around  the  estimated 
target  positions  is  simulated  using  the  Silicon  Graphics  to 
generate  data,  as  shown  in  Figure  2,  which  consitutes  the 
imaging  data.  These  two  data  sets  combine  to  form  the 
posterior  density  over  the  parameter  space  conditioned  on 
the  available  data  up  to  current  time  t.  For  each  candidate 
target  type,  a  set  of  “snap-shots”  sampling  efficiently  the 
object  space,  all  the  possible  orientations  of  the  object, 
is  pre-stored.  The  position,  orientation  and  target-type 
parameters  are  simultaneously  estimated  using  the  jump- 
diffusion  algorithm  to  sample  the  posterior  density.  The 
estimation  proceeds  by  births  and  deaths  of  track  segments 
at  random  times  through  discrete  jump  moves.  These 
moves  are  performed  by  generating  candidate  configura¬ 
tions  from  the  prior  density,  using  target  dynamics  in  this 
case,  and  selectiivg  using  Metropolis  acceptance/rejection. 
A  .second  type  of  jump  move  is  allowed  to  sample  from  the 
the  object  space  A.  A  diffusion  algorithm  is  run  between 


the  jumps  for  adjusting  the  orientation  and  position  esti¬ 
mates  following  gradients  over  the  posterior  energy.  The 
estimation  utilizes  the  prior  measure  on  target  positions 
which  is  parameterized  by  the  rotational  motion  of  the 
target.  The  object  recognition  is  coupled  to  the  target 
tracking  by  use  of  orientation  estimates  in  the  prior  mea¬ 
sure.  In  Figure  3  the  top  left  panel  shows  the  actual  flight 
path  generated  via  the  flight  simulator.  The  other  three 
panels  display  the  successive  stages  of  the  tracking  and  es¬ 
timation.  The  estimated  track  is  superimposed  in  white 
on  the  actual  track. 
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1  Introduction 

In  this  paper  we  ontUne  a  rigorous  proof  of  the  connection  between  the  optimal 
sequencing  problem  for  a  two-station,  two-customer-class  queueing  network  and  the 
problem  of  control  of  a  multi-dimeiuional  diffusion  process,  obtained  as  a  heavy 
traffic  limit  of  the  queueing  problem.  Complete  proofs  are  given  in  a  forthcoming 
paper,  Martins,  Shreve  and  Soner  (1993).  We  also  describe  how  to  use  the  diffusion 
probl«n,  which  is  one  of  "singular  control”  of  a  Brownian  motion  (also  called  "reg¬ 
ulated  Brownian  motion”  by  Harrison  (1985)),  to  develop  policies  which  are  shown 
to  be  asymptotically  optimal  as  the  traffic  intensity  approaches  one  in  the  queueing 
network. 

The  diffusion  we  wish  to  control  here  has  been  given  the  name  Brownian  net¬ 
work  by  Harrison  (1988),  who  proposed  such  models  as  approximations  to  multi-class 
queueing  networks.  The  control  of  Brownian  networks  for  the  purpose  of  obtaining 
control  policies  for  queueing  networks  was  initiated  by  Wein  (1990a,  1990b,  1992) 
and  Harrison  &  Wein  (1989,  1990).  These  papers  derive  rules  for  sequencing  cns- 

*Tlua  work  was  partially  rapportcd  by  the  Army  Reaearck  Office  and  the  National  Science 
Foandatkm  through  the  Center  for  Nonluear  Analyiis. 

^The  work  of  thia  author  was  partiaDy  supported  by  the  National  Science  Foundation  under 
grant  DMS-9200801. 
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tomer  services  and  for  contitdling  inpat  to  queueing  networks.  Laws  k.  Louth  (1990) 
and  Laws  (1991)  use  Brownian  networks  to  derive  queueing  network  rooting  policies 
as  well. 

All  these  papers  are  based  on  a  heuristic  understanding,  amply  supported  by 
simulations,  of  the  connection  between  the  Brownian  network  control  problem  and 
the  original  queueing  problem.  Such  a  connection  has  been  rigorously  established 
only  in  models  with  a  un^e  customer  class,  by  Kushner  &  Ramachandran  (1988, 
1989),  Kushner  &  Martins  (1990, 1991),  and  Krichagina,  Lou  &  Sethi  (1993).  These 
papers  use  weak  convergence  methods.  The  exogenous  processes  (e.g.  arrival  and 
service  processes)  can  be  quite  general,  provided  they  have  finite  first  and  second 
moments. 

In  this  paper,  we  assume  that  the  arrival  processes  are  Poisson  and  the  service 
times  are  exponentially  distributed.  We  base  our  analysis  on  the  Hamiltoir'Jacobi- 
Bellman  (HJB)  equation,  which,  in  turn,  is  based  on  the  Markov  property.  In 
contrast  to  other  rigorous  treatments  of  convergence,  we  are  able  to  treat  a  network 
with  multiple  customer  classes.  Our  analysis  uses  the  theory  of  viscosity  solutions 
of  HJB  equations.  Viscosity  solutions  were  first  introduced  by  Crandall  &  Lions 
(1984)  and  equivalent  definitions  were  given  by  Crandall,  Evans  &  Lions  (1984). 
For  recent  developments  we  refer  the  reader  to  Crandall,  Ishii  &  Lions  (1992)  and 
Fleming  &  Soner  (1993). 

The  particular  example  chosen  for  our  study  has  been  examined  by  Harrison 
k  Wein  (1989)  and  Chen,  Yang  &  Yao  (1991).  The  former  work  derives  a  plausible 
asymptotically  nearly  optimal  sequencing  policy  for  the  queueing  network  in  one  of 
the  parameter  cases  we  stu  'v;  we  confirm  the  asymptotic  near-optimality  of  this 
policy.  The  latter  work,  which  does  not  introduce  the  Brownian  network,  solves  the 
original  queueing  problem  in  some  parameter  cases;  we  obtain  consistent  results  in 
the  case  where  comparison  of  results  is  appropriate,  and  we  obtain  an  asymptotically 
nearly  optimal  policy  in  a  parameter  case  not  solved  by  Chen,  et.  al.  (1991). 

This  paper  is  organized  os  follows.  In  Section  2  we  describe  enough  of  the 
queueing  system  problem,  including  the  heavy  traffic  assumptions,  to  enable  us  to 
summarize  our  results.  We  complete  the  problem  formulation  in  Section  3.  In 
Section  4  we  define  the  limit  of  the  value  functions  for  a  sequence  of  queueing 
systems.  Of  course,  our  goal  is  to  represent  this  limit  as  the  value  function  for  a 
diffusion  control  problem,  and  to  use  this  representation  to  construct  asymptotically 
optimal  policies  for  the  queueing  systems.  In  Section  5  we  introduce  the  associated 
controUed  Brownian  network,  and  in  Section  6  we  reduce  the  Brownian  network 
problem  to  one  of  workload  control.  Section  7  dispatches  the  easy  case  I.  Section 
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8  provides  &n  overview  of  the  h&rdn  case  II.  FNiU  rigorous  technical  analysis  of 
a  subcase  of  case  II,  which  we  call  case  II  A,  is  pven  in  our  forthcoming  paper, 
Martins,  Shreve  k  Soner  (1993). 

We  choose  only  case  II  A  for  full  treatment  because: 

i)  it  corresponds  to  the  common  situation  of  seeking  to  minimize  the  sum  of  the 
queue  lengths  when  the  service  time  at  station  one  is  independent  of  customer  class; 

ii)  a  closed-form  solution  to  the  queudng  system  problem  in  this  subcase  is  unknown; 

iii)  the  convergence  result  in  this  subcase  requires  new  methodology;  and 

iv)  the  workload  control  problem  in  this  subcase  has  a  simple  solution. 

We  believe  that  the  techniques  developed  here  can  be  extended  to  the  other 
cases  and  to  other  problems. 


2  Summary  of  results 

We  study  a  family  of  two-station  queueing  networks  with  Poisson  arrivals  and  ex¬ 
ponential  service  times.  In  the  network,  customers  of  class  1  and  2  arrive  at 
station  1  with  arrival  rates  and  A^'*^  respectively, 

1 

Claas  5 

— -Mi) - > 


and  are  served  at  respective  rates  and  Class  1  customers  then  exit  the 
system,  whereas  class  2  customers  proceed  to  station  2,  where  they  are  redesignated 
as  class  3  customers  and  served  at  rate  ij^\ 

The  cost  per  unit  time  of  holding  a  class  i  customer  is  Ci  >  0.  The  objective  is 
to  minimize 
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(2.1) 


•'0  M 


where  is  the  number  of  class  i  customers  queued  or  undergoing  service  at  time 
t,  and  a  is  a  positive  constant. 

In  order  to  minimize  this  objective,  we  may  decide  at  each  time  t  whether  to 
serve  a  class  1  or  a  class  2  customer.  Stfvice  can  be  switched  away  from  one  class 
to  the  other  and  subsequently  switched  back,  resuming  where  it  left  off.  We  may 
also  decide  to  idle  station  1,  even  though  there  are  customers  which  could  be  served. 
This  may  be  desirable  if  there  are  no  class  1  customers  and  the  cost  C3  is  high  relative 
to  ca,  so  that  we  prefer  not  to  serve  any  class  2  customers  until  a  backlog  of  class  3 
customers  has  been  reduced. 


We  want  these  networks  to  approach  heavy  traffic  conditions  as  n  — »  00.  There¬ 
fore,  we  define  numbers  by  the  formulas 


(2.2) 


xCn) 

(n) 

tH 


x(«) 

(n) 

t^2 


-1 

■ 


x(n) 

^2 

/*3 


=  1  - 


V^’ 


so  that  1  -  is  the  traffic  intensity  at  station  1  and  1  -  b^^^/y/ii  is  the  traffic 

intensity  at  station  2.  The  heavy  traffic  assumption  is  that  for  »  =  1,2,3  and  j  =  1,2, 
the  limits 


A. 


lim 


lim  6^"^ 


are  defined,  positive,  and  satisfy 


(2.3)  sup 


VH  £  I  xW  -  Ay  I +V5:  £  I  -  C.  I + E I ‘S"’ -  y 


J=1 


isl 


i=i 


<  00. 


Our  analysis  divides  naturally  into  two  main  cases,  and  the  second  case  divides 
into  four  subcases.  We  describe  our  results  in  each  case. 

CASE  I:  Cipi  ~  C3/i3  +  C3P3  ^  0. 

As  long  as  customer  class  2  is  present,  it  should  be  served.  If  all  class  2 
customers  have  been  served,  then  class  1  customers  should  be  served. 
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This  result  agrees  with  Theorem  (5.2)  of  Chen,  Yang  &  Yao  (1991).  The 
expected  cost  reduction  per  unit  of  service  effort  devoted  to  a  class  2  customer  is 
(c3  -  C3)^3,  since  service  turns  a  class  2  customer  into  a  class  3  customer.  In  Case 
I,  (c]  —  C3)/i3  dominates  Ci/ii,  the  expected  cost  reduction  per  unit  of  service  effort 
to  a  class  1  customer.  This  results  in  the  simple  fixed  priority  rule  of  serving  class 
2  customers  whenever  they  are  present. 

CASE  II:  Cl/ll  -  C2/12  +  C3/i2  >  0. 

We  further  divide  Case  II  into  four  subcases. 

CASE  II  A;  cifii  -  02^2  +  <^3/^2  >  0,  C2H2  -  C3H2  >  0,  C2/12  -  c^ni  >  0. 

Now  a  unit  of  service  applied  to  class  1  results  in  a  greater  expected  cost 
reduction  than  a  unit  of  service  to  class  2.  We  prove  the  near  asymptotic  optimality 
of  the  policy  of  serving  class  1  unless  the  number  of  class  3  customers  falls  below  a 
positive  threshold,  in  which  case  priority  is  switched  to  class  2,  so  that  station  2  is 
not  starved.  (The  notion  of  near  asymptotic  optimality  is  defined  in  (4.4)  below.) 
The  nearly  optimal  policy  depends  on  all  the  customers  present,  and  it  is  given  by, 

serve  class  1  if  7(~ 

serve  class  2  if  7(^  ^  <  0^ 

where  Q|"^(f),»  =  1,2,3,  denotes  the  number  of  class-t  customers  present  at  time  f, 
7  is  any  function  of  the  form 


7(^1,  23)  =  a(zi)a(z3)  -  6(z2), 

and  a  and  b  are  any  bounded,  nonnegative,  concave,  increasing  functions  satisfying, 

a(0)  =  6(0)  =  0,  6(00)  <  a(oo)^. 

In  Martins,  Shreve  &  Soner  (1993),  we  show  that  as  6(00)  goes  to  zero,  the  above 
policy  becomes  asymptotically  optimal.  Harrison  &  Wein’s  (1989)  model  with  ci  = 
C3  =  C3  =  1,/xi  =  fi2  =  2,/i3  =  1  falls  into  this  subcase,  and  their  proposed 
policy  is  to  serve  class  1  if  and  only  if  ^/n  ^^"^(t)  exceeds  a  positive  constant  which 
is  independent  of  n  and  the  other  queue  lenghts.  They  showed  by  simulation  that 
with  a  properly  chosen  constant,  this  policy  outperforms  the  rules  “first-in  first-out,” 
“longest  expected  remauning  processing  time,”  and  “shortest  expected  remaining 
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procesung  time.”  They  abo  found  that  its  performance  was  close  (within  about  5%) 
of  a  lower  bound  they  obtained  for  optimal  performance. 

The  heuristic  justification  of  the  policy  is  case  11  A  suggests  that  the  same  pol¬ 
icy  is  asymptotically  optimal  under  only  the  case  II  condition  ci/ii  —C3|i2+ 03/13  >  0* 
Our  proof  of  the  result  stated  in  case  II  A  suggests  otherwise.  Although  we  have  not 
worked  out  a  full  proof  for  the  other  three  subcases,  the  proof  for  case  II  A  strongly 
suggests  the  following  conjectures. 

CASE  II  B:  ci/ii  -  C3/»3  +  £3/13  >  0,C3,  /is  -  cs/is  <  0,03/13  -  c\H\  >  0. 

There  is  a  continuous,  increasing  function  ^3  :  [0, 00)  -*  [0, 00)  satisfying, 

0  <  ♦2(u;2)  <  Vws  >  0,  and  lim  92(u;3)  =  00. 

Class  1  should  be  given  priority,  unless  either  the  queue  length  of  class 
1  customers  falls  to  zero  or  the  queue  length  of  class  3  customers  falls  be¬ 
low  some  small  threshold.  While  either  of  these  conditions  is  satisfied,  priority 
should  be  switched  to  class  2,  except  that  whenever  =  0  and  <  >/” 

/ia)),  station  1  should  be  idled.  This  idleneness  cam  be  ex¬ 
plained  by  the  fact  that  it  is  cheaper  to  hold  class  2  customers  at  station  1  than  to 
send  them  on  to  be  held  as  class  3  customers  at  station  2;  note  that  in  this  subcase, 

C2  <  C3. 


CASE  II  C;ci/ii  -  C2^l2  +  ca/i?  >  0,03/13  -  ca/12  >  0,  03/13  -  oj/ij  <  0. 

There  exists  a  continuous,  increasing  function  'i'l  :  [0,oo)  — ►  [0, 00)  satisfying, 

0  < 'i’i(wi)  < /i2u;i/w3,  Vu>i  >  0,  and  lim  4’i(wi)  =  oo. 

u/1  -^00 

Class  1  should  be  given  priority  unless  either  =  0  or  is  less  tham 
a  small  treshold.  While  either  of  these  conditions  is  satisfied,  priority  should  be 
switched  to  class  2,  except  that  when  >  0,  amd 

<  y/n  /t3*i((Qi’‘Vv^ Mi)  +  Mz)), 

priority  should  be  ^ven  to  clatss  1,  even  though  this  may  cause  station  2  to  starve. 
Idling  station  2  cam  be  explained  by  the  fact  that  the  cost  of  operating  the  network 
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can  be  reduced  more  quickly  by  serving  class  1  than  by  serving  class  2;  note  that 
>  C2H2. 

CASE  II  D:  CiUi  —  C2H2  +  ^3/^2  >  0,C2^2  —  £3/12  <  0,  C2fi2  ~  £i/*i  <  0. 

This  case  is  a  combination  of  case  II  B  and  case  II  C.  We  conjecture  the 
existence  of  functions  and  $2  ^  described  above.  Idling  can  occur  at  either 
station  1  or  station  2,  as  described  in  case  II  B  and  case  II  C,  respectively. 


3  Queueing  network  problem 

For  the  queueing  network  of  the  previous  section,  for  t  =  1,2,  let  <  t  < 

00}  be  the  class  i  customer  arrival  process,  assumed  to  be  Poisson  with  intensity 

For  I  =  1,2,3,  let  <  t  <  00}  be  the  class  i  customer  service  process, 

assumed  to  be  Poisson  with  intensity  We  take  all  these  processes  to  be  left- 
continuous,  and  we  denote  by  <  t  <  c»}  the  filtration  generated  by  these 

five  processes. 

A  control  law  {T(t),  C/(t);0  <  t  <  00}  is  a  pair  of  left-continuous,  < 

t  <  oo}-adapted,  {0,  l}-valued  processes.  The  process  Y{t)  indicates  whether  sta¬ 
tion  1  is  active  (y(<)  =  1),  or  idle  {Y{t)  =  0),  and  U{t)  indicates  whether  station  1 
is  serving  customer  class  1  {U{t)  =  1)  or  customer  class  2  {U{t)  =  0).  Given  non¬ 
negative  initial  queue  lengths  Qi"^(0),Q2"\o)  and  Q3”^(0)  for  the  three  customer 
classes,  and  given  a  control  law  (T,  U),  we  define  the  queue  length  processes 


(?W(I)  =  (?i”>(0) + -  /o' 

(jW(()  =  qW(0)  +  4"'(0  -  ^  K(o)(l  - 

=  0o"’(0)  +  /o' 


We  denote  the  vector  of  queue  length  processes  by 

= («!"’(<). 
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(Note:  Because  the  inter-service  times  are  exponentially  distributed,  the  processes 

£  ttd  si'\£ 

have  the  same  law.  This  permits  us  to  write  in  terms  of  the  former,  although 

the  latter  more  nearly  reflects  the  way  we  interpret  the  system.  If  service  of  a 
customer  is  preempted  and  later  resumed,  we  assume  that  service  be^ns  where 
it  was  left  off.  After  resumption  of  service,  the  time  to  completion  has  the  same 
exponential  distribution  as  the  orimnal  distribution  of  the  service  time.  Similar 
comments  apply  to  Q^\t)  and  Qa  (<).) 

The  vector  of  scaled  queue  length  processes  is 

y/n 

This  is  a  Markov  chain  with  lattice  state  space  =  0,1,...}^,  and,  for 

fixed  (y,  u)  €  {0, 1}^,  its  infinitesimal  generator  is 


^  ^€i )  -  <p(z)]  -f-  nX!£'\<p(z  +  -  ^(<?)] 


(3.1) 


-(-  -  :^ei)  -  v5(z)]l{„>o} 

+  nM2"^y(l  -  «)b(^  -  -^^2  +  -  V’(^)ll{z,>o} 


+  -  :;^e3)  -  ¥>(^)]1{,3>o}, 

where  z  =  (zi,Z2i23),ei  =  (Ii0,0),e2  =  (0,1,0)  and  ca  =  (0,0,1).  It  particular, 
given  any  control  law  (¥(•),[/(•)),  for  any  real- valued  function  ip  on  the  process 


(3.2)  e-"V(^^”^(0)  +  /*  c-“*(avj(Z(")(s))  -  r’*-''<')'^<*)(^(Z(’*)(s))]ds 

Jo 

is  a  local  martingale. 

Using  the  positive  holding  costs  ci,  03,03,  we  define  the  holding  cost  function 
=  2Z?s:i  CiZi.  Given  an  initial  condition  Z(’*^(0)  =  z  €  and  a  control  law 
(y’(‘),  U(*)),  we  define  the  assodated  cost  function  at  z  by 
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(3.3)  r  e-°*h(Z^''\t))dt. 

*  Jo 

In  terms  of  the  original  queue  length  process,  this  cost  can  be  written  as  (c/.(2.1)) 


The  value  function  at  2  is 

(3.4)  ji"^(2)=  inf{Jyy(2);  (K,  to  *s  a  control  law). 


For  If  :  -*  H,  we  define  the  nonlinear  operator  £”•*  acting  on  f  by 

(3.5)  £”’>(0  =  min{£"'>'-V(‘);  iv, «)  €  {0, 1}^}  Vz  € 

The  Hamilton-Jacobi-Bellman  (HJB)  equation  for  the  queueing  network  is 


(3.6)  a<p  -  £"'V  -  ft  =  0  on  L<"). 

The  foUowing  is  a  standard  verification  theorem. 

Proposition  3.1  The  value  function  is  the  unique,  linearly  growing  solution 
of  the  HJB  equation  (3.6).  If  f  is  a  linearly  growing  subsolution  (respectively,  su¬ 
persolution)  of  this  equation,  then  f  <  (respectively,  f  >  Furthermore, 

any  stationary  control  law  (y*,t/*)  satisfying 

(3.7)  =  £"’Vl"> 
is  optimal. 


4  The  heavy  traffic  limit  of  the  value  function 

In  order  to  let  n  — »  oo,  we  need  an  upper  bound,  independent  of  n,  for  the  non¬ 
negative  functions  The  foUowing  estimate  is  an  easy  consequence  of  the 

maximum  principle,  see  Martins,  Shreve  &  Soner  (1993),  Theorem  6.1. 
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Proposition  4.1  There  are  corutants  Ki  and  K2,  independent  of  n,  each  that 

<Ki  +  Kiizi  +  +  Z3)  Vr  € 

We  wish  to  consider  limn-.oo  hut  since  each  is  defined  on  a  different 
set  the  definition  of  this  limit  is  not  straightforward.  Borrowing  the  technique 
developed  by  Barles  &  Perthame  (1988)  (also  see  Fleming  &  Soner  (1993,  Section 
7.3)),  we  define  the  upper  semicontinuous  limit  J*  of  by 

(4.1)  J*(z)  =  lim  lim  sup{yi’*^;  ||C  -  r||  <  c,(  €  Vz  €  [0,oo)^, 

and  the  lower  semicontinuous  limit  by 

(4.2)  J^(2)  =  Urn  inf{  \\C  -  z||  <  €,C  6  !<")}  Vz  G  [0,oo)^ 

Then  J*  is  upper  semicontinuous,  is  lower  semicontinuous,  and 

(4.3)  0  <  y#(z)  <  J*iz)  <Ki  +  Kiizi  +  Z2  +  Z3)  Vz  €  [0, 00)^ 

In  (Martins,  Shreve  &  Soner  (1993)),  we  prove  that  =  J*,  and  also  we  use 
a  Brownian  network  problem  to  suggest,  for  each  ?/  >  0,  a  sequence  of  stationary 
policies  {(T^"^  suth  that 


“  ’^#(01;  lie  -  ■^11  <  C  e  <  T/, 

for  all  z  G  [0, 00)^.  We  call  such  a  family  (parametrized  by  rj)  of  sequences  of  policies 
asymptotically  nearly  optimal. 


5  The  controlled  Brownian  network. 

We  first  introduce  the  controlled  Brownian  network  and  then  explain  by  an  analysis 
of  the  infinitesimal  generator  why  it  is  relevant.  Let  Mi, M2  and  M3  be 
continuous  martingales  relative  to  a  filtration  {.F(f )}  satisfying  the  usual  conditions 
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that  each  J^{t)  contains  all  null  sets  of  /■((»)  and  that  ^(t)  =  n,>i^(s)  for  all  t. 
Assume  that  for  all  t, 

(6-1)  (Mi)(t)  =  2Ait,  {M2){t)={M3){t)  =  2X2t, 

(5.2)  {Mi,M2)(t)  =  {MuM3)(t)  =  0,  (JWj.MaXO  = -A^t. 

Given  z  G  [0,oo)^,  we  will  say  that  the  quadruple  of  {/■(!)}  -adapted 

processes  is  admissible  for  initial  condition  z  provided: 

(i)  (to, ii, (2,(3)  are  right-continuous  with  left-hand  limits,  with  the  conven¬ 
tion  that  /,(0~)  =  0,  I  =  1,2,3; 

(ii)  io  is  of  finite  variation  on  bounded  intervals; 

(iii)  ^i,f2/3  are  nondecreasing, 

(iv)  the  state  process  Z(t)  =  (Zi(t),  Z2(t),  Zsit))  is  in  [0,cx)^  for  tdl  1  >  0, 

where 

(5.3)  2i(t)  =  2i  +  Afi(l)  4-  Pito{i)  -f  fi(<)» 

(5.4)  ^2(1)  =  ^2  ~  i>i/<2<  +  Af2(0  -  +  h{l)> 

(5.5)  2’3(<)  =  23-1-  (6i/i2  ~  b2P3)t  +  Afs  (0  +  (0  ~  ^2  (0  +  ^3(0- 


The  cost  function  associated  with  (^o^  ^1,^2*  ^3)*  admissible  at  z  €  [0, 00)^,  is 

Vw,*.<,(»)S£re-»'MZ(<))rf(. 

Jo 

The  value  function  for  the  controlled  Brownian  network  is 

(5.6)  V{z)  =  inf  {Vio/,,r,,/,(z);(4,/i,^2,^3)  is  admissible  at  z) ,  z  €  [0,oo)^. 

The  cross  variation  formulas  (5.1),  (5.2)  imply  that  the  vector  of  martingales 
(Ml,  M2,  M3)  is  nothing  more  than  a  three-dimensional  standaurd  Brownian  motion 
multiplied  by  a  non-singular  matrix,  so  this  vector  of  martingales  is  also  a  Markov 
process.  If  we  set  the  control  processes  fo,fi,/2,^3  equal  to  zero,  the  state  process 
Z{t)  given  by  (5.3)-(5.5)  is  Markov  with  infinitesimal  generator 

(5.7)  £(p  =  -bi/i2<P2  +  ibiP2  ~  b2fi3)<P3  +  -f-  X2<P22  -  ^2*P23  +  AjV’aa. 
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where  (p  is  any  real  valued,  function  on  {0,  oo)^  with  denoting  partial  derivative 
with  respect  to  the  i*^  variable. 

The  controlled  Brownian  network  is  an  intermediate  problem  between  the 
queueing  networks  studied  thus  far  and  the  workload  control  problem  of  the  next 
section.  Although  the  value  function  is  wdl-deiined  by  (5.6),  the  problem  does  not 
have  an  optimal  solution.  One  would  Uke  to  keep  the  state  Z{t)  on  a  face  of  the 
orthant  [0,oo)^,  but  this  is  not  possible  with  the  bounded  variation  control  processes 
A)f  Fortunately,  when  we  pass  to  the  workload  formulation,  we  will  obtain 

a  wdl-posed  control  problem. 

We  conclude  this  section  with  an  asymptotic  expansion  of  the  infinitesimal  gen¬ 
erator  of  (3.1)  for  the  controlled  queueing  network.  This  expansion  explains 
the  origin  of  the  Brownian  network  problem  introduced  in  this  section. 

Suppose  (fi  ;  [0,  oo)^  is  thrice  continuously  differentiable,  and  aU  derivatives  of 
<fi  up  to  order  three  are  bounded  uniformly  on  [0,oo)^.  Fix  (y,  u)  €  {0,1}^,  and 
define 


(5.8) 


0  —  y/n 


u 


,<ri  =  v/n/i<”^(l  -  y)u,(T2  =  -  y)(l  -  ti). 


Recalling  (2.2),  we  may  write 


(5.9)  +  (!-„)=  ±  +  +  = 


For  z  €  [0, 00 )^,  we  set 


(5.10)  9i(z)  -  ^v’ii(z)ll{,,=o>, 

(5.11)  ^  ^<”^(1  -  «)v^  BJv>(z) 

-  V’a  -  2^  ('^22  +  V’33  -  2933), 

(5.12)  ,^(0  - 1  v»(z)ii,„^), 

so  that  (3.1)  becomes  (see  Martins,  Shreve  &  Soner  (1993)  for  details). 
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(5.13) 


+  »ilv>«(*)-5j59n(*)l  +  »a8jV(*) 

+  EL.  "TV*)  +  0(;Jil), 

where  Cip  fe  g^vea  by  (5.7)  utd 

(5.U)  <<*»=(».S*‘.-4**.i4'**) 

(5.15)  +  2/4"^(9»  “  +  *P3»)' 

The  expressions  in  (5.14),  (5.15)  ere  bounded  vnifcsmly  in  n.  However,  9y  cr\  and  02 
are  of  order  y/n^  as  are  the  terms  The  term  Vtp  •  in  (5.13) 

agrees  with  V(p  up  to  an  error  of  order  but  this  term  cannot  immediately  be 
replaced  by  V(p  •  ^  because  $  multiplying  it  is  of  order  y/n. 

Equation  (5.13)  suggests  that  the  controlled  Brownian  motion  Z{t)  given  by 

(5.3) -(5,5)  approximates  the  scaled  queue  length  process  The 

control  variable  9  in  (5.13),  which  can  be  either  positive  or  negative,  corresponds 

to  pushing  in  approximately  the  direction  {  =  (Mit~M2iM2)  or  the  direction  In 

(5.3) -(5.5),  this  pushing  is  accomplished  by  the  locally  finite-variation  process  to. 
The  processes  /i,/}  and  /a  appearing  in  (5.3)-(5.5)  allow  us  to  enforce  the  condition 
•^(0  ^  [0*  for  all  t  >  0.  We  have  set  up  the  controlled  Brownian  network  to 
allow  to  grow  even  when  Zi{t)  >  0;  this  corresponds  to  idling  the  serving  stations. 


6  The  workload  formulation 


Following  Harrison  &  Wein  (1989),  we  introduce  the  workload  transformation 


(6.1) 


Zi  Z2  Z2  +  Z3 

tDi  =  —  -h  — ,  W2=  - , 

Ml  M2  M3 


which  maps  the  state  space  [0, 00)^  of  the  controlled  Brownian  network  onto  the  state 
space  [0, 00)^  of  the  workload  control  problem  formulated  in  this  section.  If  (zi,  23,  Z3) 
represents  the  three  queue  lengths,  then  (wijWs)  is  the  expected  impending  service 
time  for  the  two  stations  embodied  in  customers  anywhere  in  the  network.  The 
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workload  formulation  reduces  the  dimensionality  of  the  control  problem  from  three 
(the  number  of  customer  classes)  to  two  (the  number  of  stations). 

Because  we  can  use  the  contnd  process  Iq  in  (5.3)-(5.5)  to  instantaneously 

change  the  state  Z(f )  in  the  directions  =  ±(/ii ,  -/i2,  /13)  at  no  cost,  the  Brownian 
network  value  function  V  of  (5.6)  will  be  constant  along  the  direction  (.  This  means 
that  V  (21,23,23).  can  be  written  as  a  function  of  (101,103),  because  (101,103)  does 
not  change  along  the  (-direction.  It  also  means  that  one  would  want  to  keep  the 
process  Z  (t)  on  the  locus  of  points  in  [0, 00)^  which  minimize  h  along  line  segments 
parallel  to  (.  To  find  this  locus,  one  considers  for  each  (101,103)  €  [0,oo)^  the  linear 
program 


Minimize  e\Zi  C323  +  C323 
Subject  to  ^  ^ 

=  103» 

W  1*3 

^1  >0,  23  >  0,  23  >  0. 

Denote  by  h(  101, 103)  the  value  of  this  linear  program.  We  h..  e  two  major  cases: 

CASE  I;  c\Hi-C2H2  +  ^3/13  <  0. 

In  this  case, 


(6.2)  h(t0i,  103)  =  Cl/ll  101 -I- 03/13103  , 
and  the  minimizer  in  the  11 -'ear  program  is 

(6.3)  2i  =  /I1I01,  Zj  =  0,  23  =  /l3t02. 


CASE  II:  Cl/ll  -  C2H2  +  C3H2  >  0. 
Now 


f  (C2/I2  -  C3/l2)l0i  +  C3/13102  if  /13102  >  /I2W1, 


(6.4)  h(u>i,u;2)  =  < 


[  C1/I1I01  +  ^{C2H2  -  Cl/ll  )102 
The  minimizing  values  are 

(6.5)  z\  =  0, 2j  =  /I2i0i,^  =  /i3i»2  -  if 


Zi  =  — (/l2tPl  -  f»3l»2)»  33  =  M3*»2.  ^3  =  0 
M2 


if  /I3I03  < 


/I3I02  >  /i2l»l, 
if  /13102  <  /13101. 
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TIm  worUood  eoiUrol  fnhlem  has  state  aqaatioas 

(«4)  H-iW  =  <n-ht*  r  W  + 

(M)  +  +  *>«). 

wImts  Um  pab  (mi ,  ma)  of  {/’(f  ))-ad^ted  amlial  pfoeassas  b  arfmtssalfc  far  mHi 
wwrfifisw  w  m  («i,«a)  €  (0,oo^  ptoaided: 

(i)  mi,fiia  am  right-contiaaoaa  wHk  hA-kaad  Bnhs,  with  tbe  convaaticm 
that 

mi(0“)  =  0,  isl,2; 

(ii)  miyfna  aia  aoBdeciaaai^ 

(iii)  the  state  process  1V(t)  =  (H^i(0*  ^a(O)  ^  [0>oo)^  for  all  (  >  0. 

(We  have  in  ound,  of  course,  that  mi(<)  =  m2(t)  =  where  fi,/3  are  part 

of  an  admissible  quadruple  (foifitfsyfs)  for  »e  contrdkd  Brownian  network.)  The 
cost  function  associated  with  (mi,  m3)  at  w  €  [0, 00)^  is 

=«jf 

and  the  value  function  at  w  is 

(6.8)  V(w)  -  inf{Vm,.ma(*)l  (»»i. »»»)  “  admissible  at  w). 

Although  we  do  not  need  this  foct  for  our  analysis,  one  can  show  that  V  of 
(5.6)  and  V  of  (6.8)  are  related  by  the  equation 

(6.9)  V(z„  zi,  Z3)  =  <^(^  +  ^^)  V  z  €  [0, 00)3. 

Pi  Pa  P3 

If  one  had  an  optimal  (m|,m|)  for  the  workload  control  problem,  then  as  an  op¬ 
timal  policy  for  the  Brownian  network  problem,  one  would  want  to  take  fj(t)  = 
Pim;(<),  f5(t)  5  0,  Q(t)  =  P3mS(<)>  and  choose  io  to  ensure  that  Z*{t)  is  al¬ 
ways  pven  by  (6.3)  or  (6.5)  with  «;<  =  W7(0»  »  =  1*2,  depending  on  the  sign  of 
Cipi  —  C3P3 +C3P3.  However,  such  an  io  does  not  exist,  and  so  the  Brownian  network 
contrd  problem  is  ill-posed. 
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7  Solution  of  Case  I: 


This  is  the  caae  ci/ii  -  cj/ia  +  cs/ia  <  0.  Since  h  given  by  (6.2)  is  incrensing  in  each 
variable  separately,  the  optimal  contnd  processes  mi  and  m3  act  only  wh«i  Wi  =  0 
or  W2  s  0,  respectively.  More  predsdy, 

(7.1)  mi(l)  =  max  (-toi  +  hiS  -  ^Ari(s)  -  ^M3(s)]+, 

pi  Pa 

(7.2)  m3(t)  =  max  [-i»3  +  63s  — — ^^3(s)]'*^ 

0<J<«  P3  P3 

are  the  minimal  nondecreasing  processes  which  ensure  that  the  associated  state 
processes  remain  nonn^ative  almost  surely.  In  particular, 


(7.3)  m,(t)  =  l{nr,(,)_o}<^”*i(«)»0  <  t  <  c». 

One  can  actually  compute  the  value  function 

V(u;i,tD3)  s  Vm,,mj(t»l,t02) 

(7.4) 

=  A-y  niBiWi  +  '112B2W2  +  ^le"'’’*^  +  , 


where  7i  >  0, 73  >  0  solve  the  quadratic  equations 

(3  +  tI)  7?  +  fti7i  ~  1  =  0,  ^72  +  6272  -1  =  0, 

\/*l  ^2'  /*3 


and  Bi  =  cipi/71,  B2  =  C3P3/72,  A  =  --jibiBi  -  72^2^2- 


The  formula  Zj  =  0  in  (5.3)  suggests  that  customer  class  2  should  always  have 
priority,  a  fact  already  established  by  Chen,  Yang  &  Yao  (1991).  Thus,  for  the 
queueing  networks,  we  define  the  stationary  control  law  ( independent  of  n  in  this 
case) 


Y{z)  ^ 
U{z)  ^ 


{1  if  zi  >  0  or  Z2  >  0, 
0  if  zi  =  Z2  =  0, 

f  0  if  Z2  >  0, 

(  1  if  za  =  0. 


One  can  show  that  (Y,  U)  is  asymptotically  optimal  in  the  sense  of  (4.4)  with  f/  =  0. 
We  omit  the  proof,  focussing  instead  on  the  more  complicated  case  HA  below. 
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8  Discussion  of  CASE  11: 

This  it  the  cate  ciiii  -  caMa  +  cath  >  0.  In  hfaitint,  Shieve  le  Saner  (1993)  the 
coinpiete  aaalyait  it  ipven  only  for 

CASE  IIA:  cifti  -  caMa  ca/ia  ^  0,  caMa  -  ca|ia  >  0,  es|ia  -  ci/ii  >  0. 

In  thit  cate,  the  function  h  given  by  (6.4)  it  nondecreaiong  in  each  variable 
aei»aratdy.  The  optimal  control  procnitw  for  the  m^kload  problem  are  ttiU  given 
by  (7.1),  (7.2)  and  tatitfy  (7.3),  but  V  no  longer  hat  the  timple  doted  form  (7.4). 
Becante 

(8.1)  V(tD)  =  e^hiW{t))dt,  Vw  €  [0,  oo)», 

the  Feynman-Kac  formula  and  dliptic  regularity  imply  that  V  it  on  the  open 
quadrant  (0,  oo)^,  V  it  C*  on  the  doted  quadrant  [0,  oo)’,  and 

(8.2)  Vi(0, wa)  =  ^a(»i.0)  =  0  V(tci, wa)  €  [0, oo)*, 

(8.3)  ttV-  £  -  h  a  0  on  (0,  oo)*, 

where  ^  X  X  \  X  X 

£  ^  =  -bi<^  -  ha^  +  +  -y  j  +  — ^^a  + 

'Ml  M2'  M^Ms  M3 

for  any  C*  function  9 :  (0, 00)^  — ►  R. 

CASE  IIB:  ci/«i  —  ca/ia  +  cs/ta  >  0,  ca^ia  -  €3^3  <  0,  caMr  -*  >  0. 

Now  h  it  strictly  decreasing  in  wi  for  wi  €  [0,/i3tO3/^3],  which  suggests  that  v>\ 
should  not  be  allowed  to  fall  too  far  below  P3W3//13.  Numerical  experimentation  sup¬ 
ports  the  conjecture  that  there  is  a  continuous,  increasing  function  fa  :  [0, 00)  — ^ 
[0, 00)  sudi  that  the  optimal  contrd  process  mi  in  the  workload  control  problem 
acts  whenever  lFi(<)  =  93(H^3(f))  to  ensure  that  the  inequality  IVi(t)  >  93(VF3(()) 
is  always  satisfied.  The  rest  of  the  conjecture  was  set  out  in  Section  2. 

CASE  lie,  IID:  The  functions  Vi  and  V3  appearing  in  the  Section  2  conjectures 
about  these  cases  are  the  free  boundaries  on  which  reflection  should  occur  in  the 
optimal  control  of  the  workload  processes. 
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STABILITY  AND  ERROR  ESTIMATES  OF  STOCHASTIC  INTEGRO-DIFFERENTIAL 
EQUATIONS  WITH  RANDOM  PARAMETERS 
G.  S.  Ladde 

Department  of  Mathematics,  The  University  of  Texas  at  Arlington,  Arlington,  Texas  76019,  U.  S.  A. 

and 

S.  Sathananthan 

Department  of  Mathematics  and  Center  ot  Excellence  in  ISEM,  Tennessee  State  University,  Nashville, 
TN  37209,  U.S.A. 

1.  Introduction. 

Stochastic  integro-differential  equations  arise  in  reactor  dynamics,  heat  transfer,  atomic 
scattering,  population  dynamics,  and  various  other  scientific  disciplines.  The  randomness  in  the 
equation  can  be  due  to  any  possible  combination  of  (i)  random  coefficients,  (ii)  random  initial 
conditions,  (iii)  random  forcing  functions. 

In  this  paper,  utilizing  a  generalized  variation  of  constants  formula  we  have  attempted  to 
estimate  the  error  between  the  solution  of  the  stochastic  and  the  solution  of  the  deterministic 
version(mean)  of  a  random  integroKtifferential  equation.  Recently  some  attempts  have  been  made  in 
this  direction  for  random  differential  equations  [3,5],  for  random  difference  equations  [6,7],  and  ltd- 
type  stochastic  differential  and  integro-differential  equations  [8,9,10]. 

We  also  have  obtained  some  sufficient  conditions  to  guarantee  the  stability  of  solutions. 
Furthermore,  sufficient  conditions  are  given  for  error  estimates  of  solutions  relative  to  corresponding 
smooth  systems.  The  obtained  results  of  the  present  study  would  provide  a  tool  that  verifies  to  what 
extent  incorporating  randomness  in  the  system  causes  the  change  of  behavior  of  the  system  relative  to 
its  deterministic  version. 

2.  Preliminaries. 

Let  us  consider  the  stochastic  integro-differential  equations(SIDE) 

t 

y'  =  f(t.y.w)  +  I  K(t,s,y(s,w))ds  ,  y(to,u;)  =  yo(w)  (2.1) 

*0 
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and  the  cotresponding  deterministic  initial  value  problem  (DIVP) 

t 

m'  =  f(t,m)  +  I  K(t,s,m(s))ds,  m(to)  =  E(yo(w)]  (2.2) 

^0 

which  is  obtained  by  ignoring  the  random  disturbances  in  the  system  described  by  (2.1).  In  our 
subsequent  analysis  we  will  utilize  the  following  random  initial  value  problem(RlVP) 

t 

x'  =  f(t,x)  +  I  K(t,s,x(s))ds,  x(to,t.;)  =  Xq{u)  (2.3) 

*^0 

In  (2.1),  (2.2)  and  (2.3),  y,m,x€  R",  XQ,yQ€R(n,R”];  E  stands  for  the  expected  value  of  a  random 
variable;  (f2,F,P)  is  a  complete  probability  space,f€M(R^.xR",R(fi,R"]),  K€M[R^xR^xR",R(n,R")], 
and  f(t,y,w),  K(t,s,y,Ci;)  are  sample  continuous  in  y  for  fixed  t,seR'*’,  f  eC[R^xR",R")  and 
K€C[R^xR^.xR",R"). 

We  assume  that 

(Hj)  f,  K  satisfies  desired  regularity  conditions  so  that  the  initial  value  problem  (2.1)  has  a  sample 
solution  process  existing  for  t>tQ; 

(Hj)  fx,  Kx  exist  and  1^  €C(R'^xR",R"’^"],  Kx€C(R‘^xR"xR'',R"’''’]. 

The  above  conditions  implies  that  x(t)=  x(t,tQ,ZQ)  is  unique  solution  process  of  (2.2)  or  (2.3) 
depending  on  the  choice  of  Zq,  where  Zg  is  either  yg  or  Xg. 

3.  Variation  of  Constants  Method. 

In  this  section  we  present  a  generalized  variation  of  constants  method  for  stocliastic  systems  of 
integro-differential  equations  with  random  coefficients.  This  method  gives  an  integral  representation  of 
a  function  of  a  solution  process  of  (2.1)  with  respect  to  the  solution  process  of  the  random  initial  value 
problem  (2.3)  through  (tg,yg(w)). 

Theorem  3.1. [11]  Let  the  hypotheses  (Hj),(H2)  be  satisfied,  and  y(t,u;)=  y(t,tg,yg(u;),w)  and  x(t,w) 
=x(t,tg,Xg(u;))  be  the  sample  solution  processes  of  (2.1)  and  (2.3)  existing  for  t>tg  with  Xg((<;)  = 
yo(‘*’)-  Further  assume  that  Vx(t,x,w)  exists  and  is  sample  continuous  for  (t,x)€R.^xR”.  Then, 

t 

V(t,y(t,w),u;)  =  V(to,x(t,u;),w)+  |  |Vs(s,x(t,s,y(s,w)),i.;) 

t© 


+Vx(8.x(t,8,y(s,w)),w)*(t,s,y(8,u;))R(8,y(8,w),Ty(s,w),w)|d8 
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I  I 

+  I  |CVx(<r,x(t,<r,y(<r,w)),w)*(t,<r,y(<T,w))-Vx(8,x(t,s,y(8,w)),u;)L(t,s,y(s,w),w)]] 


k(o’, 8, y(8,u)))d(rd8. 


i 

where  R(t,y,Ty,w)  =  f(t,y,w)-f(t,y)  +|  (K(t,8,y(8,w),u;)-k(t,8,y(8,w))]d8.  (3.1) 

*0 

Proof:  From  (2.1)  and  (2.3),  8y8tem  (2.1)  can  be  rewritten  as 

t 

y'  =  f(t,y)  +  I  k(t,8,y(8,w))ds  +  R(t,y,Ty,w)  (3.2) 

*0 

where  R(t,y,Ty,w)  is  as  defined  in  (3.1). 

Let  x(t,s,y(s,w))  and  x(t,a;)  =  x(t,tg,yQ(u;))  be  the  sample  solution  processes  of  (2.3)  through 
(s,y(s,w))  and  (to)yo(‘^))t  respectively,  and  y(s,a))  =  y(s,tQ,yo(w))  be  the  sample  solution  process  of 
(2.1)  through  (to,yo(u;)).  Now  we  compute  the  total  sample  derivative  of  V(s,x(t,s,y(s,(j)),u>)  with 
respect  to  s  as 

^V(s,x(t,s,y(s,w)),a/)  =  Vs(s,x(t,s,y(s,u;)),w)+Vx(s,x(t,s,y(s,w)),w){^x(t,8,y(8,w))] 

=  Vs(s,x(t,s,y(s,w)),w)+Vx(s,x(tAy(s,w)),w)|-<>(t,s,y(s,w))f(5,y(s,w)) 

t  s 

-|L(t,<r;s,y(s,w))k((T,s,y(s,w))d<r+«(t,s,y(s,w))(f(8,y(s,u;))+|  k(s,^,y(^,w)d^ 

®  to 

+R(8,y(s,u;),Ty(s,w),u;)))| 

t 

=  Vs(s,x(t,s,y(s,w)),u;)+Vx(8,x(t,s,y(s,w)),a;)  |-|L(t,(r;s,y(s,w))k(<T,s,y(s,w))d<r 

s 

s 

+<>(t,s,y(s,w))(|  k(s,€,y(^,w)d^  +  R(s,y(s,w),Ty(s,w),w)))|  w.p.l  (3.3) 

to 

Integrating  in  the  sample  sense  both  sides  of  (3.3)  from  tg  to  t,  and  noting  x(t,t,y(t,tg,yg(a;),b;))  = 
y(t,tg,yg(a;),w),  we  obtain, 

t 

V(t,y(t,w),w)  =  V(tg,x(t,u;),a;)+  |  |Vs(s,x(t,8,y(8,w)),w) 

to 

+Vx(8,x(t,s,y(s,w)),c.;)*(t,s,y(s,w))R(8,y(s,u;),Ty(s,a;),w)}ds 
t  t 

Vx(s,x(t,s,y(s,w)),fa;)|  L(t,<r;s,y(8,u;))k((r,s,y(s,w))d(rds 
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(3.4) 


+  I  Vx(s,x(t,8,y(8,w)),w)*(M,y(s,w))|  K(8,(,y((,w))<I(<ls 

U8ing  Fubini’s  Theorem  the  last  term  in  (3.4)  can  be  written  as 
t  8 

1 1  Vx(s,x(t,s,y(8,w)),w)<t(t,s,y(s,w))k(8,(,y(^,w))d(ds. 

to^o 


t  8 


=  I  j  Vx(<r,x(t,<r,y(«r,w)),w)*(t,rr,y(<r,<j))k(<r,s,y(s,u>))dirds. 

to^o 

Using  (3.5),  (3.4)  can  be  rearranged  as  (3.1)  and  hence  the  theorem  follows. 


(3.5) 


Corollary  3.1:  Let  the  assumptions  of  Theorem  3.1  holds.  If  V(t,x)  =  x  with  n=m,  then 
reduces  to 

t 

y(t,w)=x(t,u;)+j  <t(t,s,y(s,w))R(s,y(s,w),Ty(s,tj),w))ds 


t  t 

+  I  ||<I'(t,<r,y(<r,w))-L(t,s,y(s,u;))|  k(r,s,y(s,w))d<rds. 


to® 

Remark  3.1:  If  f(t,y,w)  =  A(t,w)y,  K(t,s,y,w)  =  a(t,s,u;)y,  then  (3.6)  reduces  to 

t 

y(t,u;)=x(t,w)+|  <I'(t,s)R(s,y(s,w),Ty(s,u;),w))ds 

Corollary  3.2:  If  V(t,,\,u;)  =  ||x||^  then  (3./)  in  Theorem  3.1  reduces  to 

t 

||y(t,w)||^=  ||x(t,w)||2  +  2  j  x'’’(t,s,y(s,u;))<t(t,s,y(s,u;))R(s,y(s,u;),Ty(s,u;),u;)ds 


(3.6) 


(3.7) 


t  t 

+2|  j  |x^(t,(T,y{<r,u;))<I'(t,<T,y(<r,w)) 

to®  -x^(t,s,y(s,w))L(t,<r;s,y(s,w))|k(or,s,y(s,w))d<rds.  (3.8) 

Theorem  3.2:  Suppose  all  the  hypotheses  of  Theorem  3.1  hold.  Then, 

t 

V(t,y(t,u;)-x(t),w)  =  V(t(,,x(t,(.;)-x(t),w)+  j  |Vs(s,x(t,s,y(s,w)-x(t,s,x(s)),w)) 

to 

+Vx(s,x(t,s,y(s,(.;))-x(tAx(8),w)*(t,s,y(8,w))R(8,y(s,w),Ty(s,u;),t.;)}ds 
t  t 

+1  j{Vx(<r,x(t,<r,y(o,w))-x(t,o,x(<r)),u;))*(t,<r,y(<r,w)) 
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-Vx(s,x(t,s,y(s,w))-x(t,s,x(s)),w))L(t,s,y(s,w))|k((T,s,y(s,w))d<Tds. 


(3.9) 


where  x(t)  =  x(t,tQ,ZQ'  is  the  solution  process  of  either  (2.2)  or  (2.3)  depending  on  the  choice  of  Zg. 

Proof:  By  following  the  proof  of  Theorem  3.1,  we  have  the  relation, 

^V(s,x(t,s,y(s,u;)-x(t,s,x(s)),w))  =  Va(s,x(t,s,y(s,w)-x(t,s,x(s)),w)) 

+Vx(s,x(t,s,y(s,w))-x(t,s,x(s),w)  ^x(t,s,y(s,a;))-x(t,s,x(s) 
Using  this  and  following  the  steps  of  Theorem  3.1,  the  theorem  can  be  easily  obtained. 

Corollary  3.3:  If  V(t,x,a;)  =  ||x||^,  then  (3.9)  in  Theorem  3.2  reduces  to 

l|x(t,t(j,yo(u;))-x(t,to,Zo)j|^ 

t 

+  2  I  (x(t,s,y(s,w))-x(t,s,x(s))’^*(t,s,y(s,w))R(s,y(s,w),Ty(s,w),w)ds 
•■0 
t  t 

+2|  |(x(t,<T,y(<T,w))-x(t,<T,x(<r)))'^<i>(t,s,y(s,w)) 

-(x(t,s,y(s,w))-x(t,s,x(s)))’^L(t,tr;s,y(s,t.;))]k(<r,8,y(s,w))dods.  (3.10) 

We  recall  that  x(t)  =  x(t,tg,ZQ)  is  the  solution  process  of  either  (2.2)  or  (2.3)  depending  on  the  choice 
of  Zg.  In  other  words  x(t)  is  either  m(t)  =  m(t,tg,Zg)  =  x(t,tg,mg)  or  x(t,tg,xg(w)). 

4.  Stability  Analysis 

By  employing  the  preceding  results,  we  give  sufficient  conditions  for  the  p-th  moment  stability  [4,8,9] 
of  the  trivial  solution  process  of  (2.1). 

Theorem  4.1:  Let  the  hypotheses  (Hj),  (H2)  be  satisfied,  and  y(t,u;)  =  y(t,tg,yg(a;))  and  x(t,w)  = 

x(t,tg,Xg(w),u;)  be  the  sample  solution  processes  of  (2.1)  and  (2.3)  existing  for  t>tg  with  Xg(w)= 
yg(w),  Vx(t,x,w)  exists  and  is  sample  continuous  for  (t,y)  €R^.xR".  Furthermore,  V(t,x),  x(t,a;), 
$(t,s,y(s,w)),R(s,y(s,w),Ty(s,w),a;),f(s,y(s,w)),k(t,s,y(s,w))  satisfy 

m 

(i)  bdlxjl**)  <  ^2  I  ^i(*'t*)l  ^  ®(  ll^^lr  )  fo''  (t»*)  €  R.J.X  R"  where  p>  1,  bel^’K  and  a€  C3G; 

i=l 

(ii)  f(t,0)=  0,  k(t,s,0)  =  0  with  probability  1  for  all  s<t  G  R"*^; 
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s 

(iii)  ||'®V(s,  x(t,8,y),w)|l  <  Aj(t-8,w)l|V(8,y(8,w))ll+  |  A2(s,w)R(t,T)||V(T,y(T,w))||dT 

‘o 

where  Iq  <r<  s  <  t,  and  ||y||^  <  />,  where  p  is  some  positive  real  number, 

^V(8,x(t,s,y))  =  VgCs,  x(t,s,y))  +  Vx(8,x(t,8,y),w)|*(t,8,y(s,w),u>)R(s,y(8,w),Ty(s,w),w) 

8  t 

+*(t,s,y(s,w),w)| K(8,^,y((,w))d^- jL(t,<r;8,y(8,w))K(<r,s,y(8,w),u;)dsJ  (4.1) 
‘o  • 

x(ttS.y)  is  tbe  solution  process  of  (2.3)  through  (s,  y),  p>0  and  A€C[  R^,  R^J  n  L^(R^,  R^]  defined 

s 

by  A(s,w)  =  Aj(s,(t;)  +  J  A2(s,a;)K(8,r,(j)dr,  A^and  p  €  C[  R^,  R^]  H  L^(R^,  R^J  and  p  is 
t  ‘o  t 

defined  by  p  =  J  H^(t,s,(j)ds  with  H(t,s,(t;)  =  A£(s,(«')  +  jR(t,s,w)A2(r,u;)dr  such  that 
to  s 

®  R.j.xR^  and  A(s,u)  satisfies  the  relation 

1 

(4.2) 


’  oo  n 

"t  1 

E  exp(  A(s,to,w)ds] 

<ex|: 

1  A(s)ds 

L  0 

to 

(iv)  II  V(to,x(t,w),w)||  <  a(  II  yo(w)||'^).  whenever  E[  ||yol|*^]  <  P  for  some  p  >  0, 

where  a  €  C3G.  Then  the  trivial  solution  process  of  (2.1)  is  stable  in  the  p-th  moment. 

Theorem  4.2.  Assume  that  the  hypotheses  of  Theorem  4.1  hold  except  that  (iii)  and  (iv)  are 

replaced  by 

(iii)  l|^V(s,x(t,s,y),w)||  <  Ai(s,w)r;^(t-s,u;)||V(s,y,c.;)|| 

s 

+A2(s,w)i72(t-s,w)|  i73(s-r,w)||V(r,y(r,w),u;)||  for  to<  s<t, 

to 

(iv)  l|V(to,x(t,w),u;)||  <  o(  II  yo(‘v)||*’)/?(t-to),  t>  tg, 

provided  E[||yQ(w)||**]  <  p  where  o  €CK  ;  and  satisfies 

’ll(t*s,)^(s-to)  <  K^(t-tQ' 

*?2(t‘^)’/3(’’*s)  <  Kijj(t-8)A(t'8)  for  some  A  €l.^nC[R^.,R^l  and 

tSSof I  A(8,to)dsJ 


=  0 


(4.3) 
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where  A(8,tQ,w)  =  KAi(s,w)  +  k|  A2(s,w)A(s-{,w)d^.  Then  the  trivial  solution  process  is 

^0 

asymptotically  stable  in  the  p-th  mean. 

5.  Error  Estimate  Results. 

We  present  a  few  error  estimate  results  by  employing  the  method  of  variation  of  parameters  with 

regard  to  systems  of  integro-differential  equations  with  random  coefficients. 

Theorem  5.1.  Let  the  assumptions  of  Theorem  3.2  be  satisfied.  Further  assume  that 
■  m 

(0  b(|lx||*^)  <  2  I  Vj(t,x,w)|  <  a(l|x||*^), 

i=l 
m 

(ii)^lDVj(s,x(t,s,y)-x(t,s,z),a;)l<Aj(t-s)C(lly-2ll)+A2(t-s)C(llzll) 

i=l 

s 

+1  A3(t-s)a(t,r)C(||y(T)-z(T)||'’)dr 
^0 

where  to<r<8<t,  a€5b  and  it  is  differentiable,  b€  1^*,  C  €  3G,  and  DVj(8,  x(t,s,y)-x(t,s,z),fa;)  is 
the  i-th  component  of 
DV(8,  x(t,s,y)-x(t,s,z),w) 

=Vs(s,  x(t,8,y)-x(t,8,z),w)  +Vx(s,x(t,s,y(s,w))-x(t,s,x(s)))|*(t,8,y(w))R(8,y,Ty,w) 
s  t 

+  *(t,s,y(s,u;))|  K(s,^,y(^,w))d^  -J  L(t,<r;s,y(s,w))K(<T,s,y(8,w))d8}  (5.1) 

‘O  *  1 

p>l,  Aj.Aj.AjC  C[R^,R4.]  n  L^[R‘^,R'^].  Let  us  define  H(s),  ^  H  =^.  Ms)  =  C((b*^(8))^) 
and  assume  that  H  €  X. 

Then, 

E[|(y(t.a;)-x(t)f]  <  b-i{r(t,w)]  (5.2) 

for  t  >  tQ,  where  y(t,w)  and  x(t,w)  are  the  solution  process  of  (2.1)  and  (2.3)  through  (to,yo(w)),  and 
x(t)  =  x(t,tQ,Zg)  is  the  solution  process  of  (2.2)  or  (2.3)  depending  on  the  choice  of  Zg:  0{s,u)  is  the 
absolute  value  of  the  sum  of  A2(0,u;)C|Ix(s)||*’  and  the  time  derivative  of  a  (l(x(t,w)-x(t)||**);  r(t,u;) 
is  the  maximal  solution  process  of  the  integro-diferential  equation 

t 

m'(t,u;)  =  0{t,w)  +  Ai(0)  h(m(t,w))  +  j  A3(0,w)  a(t,T)h(m(s,w))ds.  (5.3) 

^0 

Corollary  5.1:  Suppose  that  all  the  hypotheses  of  Theorem  5.1  are  satisfied  except  the  differentiability 
of  a  and  assumption  (ii)  are  replaced  by 

li«(t,s,y)||  <  K  for  tg<8<t,  y€R",  (5-4) 

and  assumption  (ii)  is  valid  whenever(5.4)  holds,  where  K  is  a  posive  real  number.  Then  (5.2) 
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reduces  to 


E(||y(t,w)-jt(t)||Pl  <  b-*(r(t.u,)] 

where  r(t,w)  is  the  maximal  solution  process  of  the  integro-differential  equation 

t 

m'(t,u.)=A2(0,u;)C(||x(t)||)+Ai{0)h(m(t.w))+j  A3(0,u;)a(t,r)h(m(s,w))ds,  t>to  (5.5) 

‘o 

The  details  of  the  proofs  of  these  presented  results  will  appear  elsewhere. 
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Abstrsct:— Wc  considered  several  population  models  such  as  exponential  growth,  logistic  growth  and 
competetion  models  with  random  coefficients  and  random  initial  conditions  as  random  parameters,  if,  otw  or 
more  of  the  initial  conditions  are  degenerate  constants,  then  Liouville's  theorem,  which  describes  the 
evolution  of  the  Jacobian  of  the  mapping  is  no  longer  applicable.  Bated  on  a  nunterical  technique  developed 
by  Bellomo  and  Pistone  [2],  we  obtained  joint  probability  density  function  for  the  dependent  variables  or  the 
marginal  probability  density  function  for  the  individual  dependent  variables.  Nuntterical  methods  are  slab 
explored  in  these  cases. 

1.  INTRODUCTION 

Modern  population  biology  is  based  on  the  fundamental  models  such  as  exponential  growth,  logistic 
growth,  and  competition  models.  Yet,  their  adequacy  is  questionable,  in  part,  due  to  their 
deterministic  properties.  There  has  been  a  number  of  studies  based  on  these  models  and  treated  the 
coefTicients  or  initial  conditions  to  reflect  random  environmental  fluctuations  [5].  Our  analysis  is  based 
upon  treating  one  or  more,  and  perhaps  all,  of  the  initial  data  to  be  deterministic.  In  this  approach, 
Liouville’s  theorem,  which  describes  the  evolution  of  the  Jacobian  of  the  mapping  is  no  longer 
applicable.  A  more  generalized  valid  technique,  utilizing  standard  numerical  methods  for  solving 
differential  equations  has  been  developed  by  Bellomo  and  Pistone  [2]  and  utilized  by  Harlow  and  Delph 
[3]  in  solving  differential  equations  numerically.  This  technique  yields  Liouville’s  theorem  as  a  special 
case  and  utilizes  the  direct  mapping  between  the  nondegenerate  random  variables  and  the  space  of 
dependent  variables.  We  have  utilized  this  technique,  in  obtaining  the  joint  density  or  the  marginal 
density  of  the  individual  populations.  The  first  and  second  moments  are  obtained  to  study  the 
qualitative  properties  of  populations.  Standard  numerical  techniques  are  used  for  various  cases. 


The  research  reported  herein  was  supported  by  the  Army  Research  Office  Grant  No.  DAAH-04-93-G- 
0024  and  the  National  Security  Agency  Grant  No.  MDA-904-93-H-2002. 

This  oaoer  was  prosentec^  at  the  Tenth  Conference  in  this  series. 


2.  PRELIMINARIES 


Consider  the  random  diflerential  equation, 


where  X=(Xj,X2,X3,.MXn)'^,  g=(gi«g2>— Mgn)^  •  and  T  denotes  transpose.  The  parameters 
Xq=  (Xjq,  X2q,...,Xjjq)  with  joint  probability  density  function  (Aj,A2i—Ain)  with 

joint  probability  density  fy^(A)  are  assumed  to  be  random.  A  and  Xg  are  assumed  to  be  independent. 
We  would  like  to  consider  the  broader  class  of  problems  in  which  at  least  part  of  Xg  is  degenerate. 

We  need  the  following  results  in  our  subsequent  analysis. 

Lemma  2.1  [3]  Assume  that  (2.1)  has  a  unique  solution  h(t,tg,Xg)  on  [  a,  b].  Let  fx  and  are 
continuous  on  the  domain  D(t,X,A)  of  (t,X,A)  of  dimension(n+m+l).  Then  the  solution  is 
continuously  differentiable  on  {  a<t<b;  U}  where  U=  {  a<tg<b;  |Xg— ♦(tg)|  +  jA  — Ag|  <  tf}. 
Furthermore,  =  Y  is  the  solution  of  the  initial  value  problem 

^  =  fx(t,h(t,tg,Xg,A),A)Y  +  ^(t,h(t.tg,Xg,A),A);  Y(tg)  =  0  (2.2) 

Similarly,  =  Z  satisfies 


^=g(t,X,A);  X(tg)  =  Xg 


(2.1) 


—  lx(^'^(^i^0»Xg,A),A^  Z;  Z(tg)  —  Cj 


(2.3) 


where  ej  is  the  vector  ej=  (0,0,0,..., 1,0, 0.., 0)  ^ . 


Lemma  2.2  [4]  Let  Xgg  represent  the  i mdegenarate  probability  subspace  contained  in  Xg,  and  the 
dim(XQg)  =  JL  <  n,  where  n  =  dim  Xg.  We  consider  the  following  three  different  cases. 

Case(i);  We  consider  the  situation,  in  which  0<JL<n,  l<..4b<  n,  and  i.+-4t=  n.  In  this  case, 
dim[  Xgg,A]=  dim  X.  Therefore,  the  mapping  (Xqq,A)— ►X=h(t,tg,XQg,A)  is  well-defined.  If  the 
mapping  is  assumed  to  be  continuous,  and  well-defined.  If  the  mapping  is  assumed  to  be  continuous, 
and  one-one,  then  ^(t,tg,X)  =  (Xgg,  A)  e.xists,  and  the  Jacobian  J(t,tg,X)  of  the  inverse  mapping, 
given  by,  h~^;  X— »(Xg,A)  and 

QL  —  1 

J(t,tg,X)=  1^1  =  l-^l  (2.4) 

The  probability  density  function  for  (Xgg, A)  to  that  of  X  is  given  by, 

~  ^AjXgg^^’^O’  ^  (2-5) 
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Ca8e(ii):  We  consider  the  situation,  in  which  0<l<n,  1<  Jl><n,  and  JL+^>n.  Let  q=  JL+^n. 
In  order  to  obtain  an  invertible  mapping,  we  carryout  a  standard  augmentation  of  the  probability 
space  of  X.  To  be  specific,  let  us  assume,  without  loss  of  generality,  that  q<w4b,  and  define 

®1  =  ^^q+1*  ®2=  ^Ji-q+2'  ^jfc-q+3 . 

We  now  consider  the  mapping  (Xqq,  A)-*(X,  B)=  ^h(t,tQ,XQ,A),  B(A)^;  which  is  well-defined. 

The  Jacobian  of  the  mapping  is 

K(M„,A,X^)  =  I  (2.6) 

We  assume  thai  Che  mapping  is  invertible  in  the  form  (Xqq,A)  =  ^h~^(t,tQ,X,B),  A(B)^,  and  hence, 

the  Jacobian  of  the  inverse  mapping  satisfies 

Jft  t  X  B)  —  _ _ 1 _ 

J(t,t(,,X,B)  -  I  gj  I  _  K(t,tjj.(A,XoQ)) 

=  (h-l(t,to,X,B),A)  (2.7) 

We  can  compute  the  joint  probability  density  function  on  f^  g  from  the  relation 


—  ^A,X0q(^’^O’^  ^(MQ»X.b),A(b)^|J(t,tQ,X,b)| 


(2.8) 


and  the  jpdf  for  X  may  be  obtained  from  equation  (2.8)  by 


tx(t,t|,,X)  =  f....  |l 


g^t,tQ,X,b^db^db2db^..dbf^ 


(2.9) 


we  will  consider  the  case  L-l- Jl><n,  the  case  for  which  the  sum  of  the  dimension  of  the  nondegenerate 
initial  conditions  and  the  dimension  of  the  random  parameters  A  is  strictly  less  than  the  dimension  of 
X.  In  contrast  to  the  first  two  cases  of  consideration  the  inverse  transformation  X— ♦(Xqq,A)  is  not 
well-defined  and  hence  the  Jacobian  J  does  not  exist.  The  joint  probability  density  function  f^  may  be 
either  zero  or  does  not  exist.  Nevertheless  some,  and  possibly  all  the  marginal  pdfs  may  be 
constructed  by  means  of  the  following  procedure. 


Case(iii):  Let  0  <Jl<n,  l<.A<n,  and  JL-i-.A<n.  We  will  find  the  prf  for  the  variable 

Xj=b.(t,tQ,A,XQ),  l<i<n.  We  will  assume  that  the  solution  for  Xj  is  invertible  in  one  of  the  random 
parameters  (A,  Xqq).  Without  loss  of  generality,  assume  that  this  parameter  is  A|,  so  that  we  may 
write 


Aj=hi 


-1 


(*’*0’^i’^00r^X002’""’^001’ 


We  perform  the  following  augmentation,  and  define 


(2.10) 


®l-‘^2’ . ’  >  ^001 


(2.11) 
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By  virtue  of  equation  (2.11)  the  direct  mapping  (  A,Xqq)-»(X|,B)  exists,  and  the  Jacobian  has  the 
simple  form, 

K(t,to,A,Xo)  =  ^  (2.12) 

and  hence  the  Jacobian  for  the  inverse  mapping  satisfied, 

J(litO’^i’®)  ~ 

The  joint  pdf  is  given  by, 

for  which  the  marginal  pdf  for  X|  is  given  by 

Now  we  are  in  a  position  to  discuss  the  main  results. 


3.  MAIN  RESULTS 

We  will  discuss  the  basic  biological  models  such  as  exponential  growth,  logistic  model  and  competition 
model  treating  parameters  as  random. 

3.1  EXPONENTIAL  GROWTH  MODEL: 


Consider  the  exponential  growth  model  of  the  form, 

^  =  rN.  N(0)  =  N^ 


(3.1.1) 


Here,  r  and  Nq  can  be  treated  as  random  parameters. 

1.  RANDOM  INTRINSIC  GROWTH  RATE: 

Assume  that  the  initial  condition  Nq  is  deterministic  constant  and  r  is  a  random  variable  and 

n 

independent  of  time  t.  Let  r~  Normal(  f,<r^). 

Then,  fr(r)  =  exp|-i^!^)2|,  -oo<r<oo 

our  aim  is  to  study  the  mean  and  variance  of  In^.  That  is,  Ej^ln^j  andvj^lnjJLj. 


the  solution  of  (3.1)  is  given  by,  N(t)  =  NqC^^ 


(3.1.2) 


The  space  (XQg,A)  =  r  and  X=:  N.  Therefore,  the  Jacobian  of  the  mapping  J:  (  Xqq.A)— ♦X  is 
denned  and  is  given  by,  |  — jl  =  ^ 
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Therefore,  fj^(N)  =  -oo<N<oo 

Let,  Z  =  1  f  ~  N(0,1)  and  we  obUdn,  In  =  t<rZ  +  tf. 

«  l  Nq  Wq 

Therefore,  ij^ln  =  tf  and  v|^  In  ]^j~ 


(3.1.3) 


(3.1.4) 


If  we  assume  that  the  initial  condition  Nq  is  a  deterministic  constant,  and  r  is  a  random  variable 
follows  Unif(l,2)  and  independent  of  time.  That  is  r'^Unif(l,2). 

Then,  fN(N)  =  ^  ;  1<  {in^  <  2  (3.1.5) 


Now, 


*{'“  1^]  ^  f  A ~  4'"  &  =  I  ('“  ifc  =  5‘^' 

N„«‘  N„.‘ 


and  hence  we  obtain, 


(3.1.6) 


2.  RANDOM  INITIAL  CONDITION; 

Assume  that  r  is  a  deterministic  constant,  and  Nq  is  a  random  variable  and  follows  exp(A)  and 
independent  of  time  t. 


That  is,  Nq~  exp(A),  and  hence  fj^(N)  =  e  ^*.Ae“'^^*  ,  N>0. 


Therefore,  E(  N)  =  ie'‘  and  V[N]  =  ^  e^^* 


(3.1.7) 


2 

If  r  is  a  deterministic  constant  and  Nq  is  a  random  variable  follows  N(/i,  a  )  then,  we  obtain 
E[N]  =  /je'S  and  V(N]  = 


3.  RANDOM  INTRINSIC  GROWTH  RATE  AND  RANDOM  INITIAL  CONDITION: 

Assume  that  both  r  and  Nq  are  random  variables  and  independent  of  time.  In  this  case  the  Liouville's 
theorem  fails  and  we  use  case  (ii)  of  Lemma  2.2.  Here,  X=  (N,r),  and  the  Jacobian  of  the  direct 
mapping  (Xqq,A)— ►X=(N,r)  is  given  by,  K=|^^|  =  e^*.  If  Nq,  r  are  independently  UniformIy(  1,2) 
Then, 

fj^,r(t,N,r)  =  e“'‘,  1<  r<  2,  1<  N“'*<  2. 


Then, 


2  e'‘ 

ElN]  =11  Ne-^‘dN  =  l(e2‘_e*),  =  fj(e'*‘-e2‘). 


(3.1.8) 


703 


3.2  LOGISTIC  GROWTH  MODEL 
Consider  the  Logistic-Growth  model, 

^  =  rN(l-g),  N(0)  =  No>0 
Here  r,  K  and  Nq  can  be  treated  as  random  parameters. 

1.  RANDOM  INTRINSIC  GROWTH  RATE: 

Assume  that  r  is  a  random  variable  while  K  and  Nq  are  deterministic  constants. 

A 

Let,  r~Normal  (f,  <r  ). 

Then,  (XgQ,A)  =  r— »X=N  is  well-defined  and  hence  the  Jacobian  of  the  direct  mapping  =  |^| 
The  solution  of  (3.2.1)  is  given  by, 

KNge'^  (K-Nn)KNne^‘ 

N  =  - - - ^  and  Jacobian  of  the  direct  mapping  = - =-7 — is — xt 

K-N,+N,.'‘  (K+N„.'‘-N„)2 


(3.2.1) 


Therefore,  f]i4(N)  =  ff(r  in  terms  of  N,  Nq  and  K). 


(K-t-Nge^^-Nfl)^ 


(K-t-Nge^-Ng)^ 


<r(K-N)Nt 


«  (K-Ng)KNge^‘t 

-  1  K  _ lF  l|n/  N  .I2I 

■  <r(K-N)Nt  ‘  K  (K-N)  V  J 

Take.  Z=  tg‘"{i^^(K-N)^~0’  therefore  In  +  ft,  which 


If  r  is  uniform  random  variable  ~  Unif(l,2)  while  K  and  Ng  are  deterministic  constants. 


Then, 


Then, 


%(N)  -  fr^  I  ln{  (K-N)Nt 

=  (K-N)Nf  ‘  <  I  '"  { iAi  2 

/in  / -N  _  3t  j  vF,  f  N  _  1  ^2 

INg  (K-N)  /J  “  2‘  Y  (K-N)  /J  "  12^  • 


2.  RANDOM  CARRYING  CAPACITY: 


Assume  that  K  is  a  random  variable,  r  and  Ng  are  deterministic  constants. 

Let  K~  Unif(l,2),  then  ,  ,  , 

Nn(e'‘-l)e'‘  NNn(l-e^‘) 

f«(t,N)  =  jV.  1<_0L_^<2 


(Nge'‘-N)2 


N-N„e^ 


(3.2.2) 


(3.2.3) 


(3.2.4) 


(3.2.5) 


We  can  obtain  the  mean  and  variance  as  follows: 
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(3.2.6) 


EtN(t)]  =  |  +  N/‘ 


an<.V(N(t)l= 

-NQ^(e'‘-l)V"‘lnl  > 

O'  NQ(ef‘-l)  +  2 

3.  RANDOM  INTRINSIC  GROWTH  AND  CARRYING  CAPACITY: 

Assume  that  r  and  K  are  random  variables  and  Ng  is  a  deterministic  constant.  In  this  case, 

(Xgg.A)  =  (r,K),  X=  N.  Therefore,  the  Liouville's  theorem  fails  and  we  will  consider  the  case  (ii)  of 
our  Lemma  2.2  and  augment  the  space  X.  Consider  the  mapping  (Xgg,A)  =  (r,K)— ►X  =  (N,r)  is  well- 
defined  and  the  Jacobian  of  the  direct  mapping. 


K  = 


_ ,  axT . 


^^-1  =  — ^5 — - j,  and  hence  the  joint  density  of  N,  r  are  given  by 


NgV^-^^ 

(Nge'‘-N)^ 


(3.2.7) 


Assume  that  r  and  K  -i.i.d.  with  Unif(l,2).  Then,  the  joint  density  of  (N,r)  given  by  (3.2.7)  reduces 


The  marginal  density  fj^(t,N)  can  be  obtained  by  integrating  (3.2.8)  with  respect  to  r. 


(3.2.8) 


4.  RANDOM  INTRINSIC  GROWTH,  CARRYING  CAPACITY  AND  INITIAL  CONDITION: 

If  the  parameters  r,  K,  and  Ng  are  random  variables,  i.i.d.  Unif(l,2).  Here  Liouville’s  theorem  fails 

and  we  need  use  :ase  (ii)  of  the  Lemma  2.2.  We  obtain,  the  joint  density  function 

follows: 


- 5,  1  <  r  <  2,  l<  K<  2, 


f  „(t,r,K,N)  = - - 9 

r.K,N  j  (K-N)e^‘-HN)2 


Ke*^^  <  <  2Ke‘ 


rt 


(K-l)+e 


Tt 


(K-2)+2e' 


rt 


(3.2.9) 


The  marginal  density  ffj(t,N)  can  be  obtwned  by  integrating  (3.2.9)  with  respect  to  rand  K. 

.3  3  COMPETING  SPECIES  MODEL 

Consider  the  competing  species  model  ^ 

V  =  Xi(.i-V  Vj).l=l,2 
J=1 


(3.3.1) 
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where  x-  is  the  population  density  of  the  i-th  species  in  the  community.  Let  a^>0,  or2>0  be  the 
equilibrium  states. 

Then,  (3.3.1)  becomes, 

wj'  =  -bijwjoj-^  b..(aj+w.)w.,  i=l,2  (3.3.2) 

j=l 


The  total  number  of  parameters  are  b^^,  b^2<  ^21*^22* 

deterministic  constants,  constants,  and  b^^,  b^2>  ^21*^22  random.  In  this  case 

(XQ0,A)=(bjp  b22.  b2j,b22).  X=  (wj,W2)  therefore,  Liouvill’s  theorem  fails  and  we  use  case  (ii)  of 
Lemma  2.2  and  we  extend  the  space  X  as  X=  (wj,W2,  ^J2’^2l)* 


Therefore,  the  mapping  (Xq^,  A)  =  (b^j,b|2<b2iib22)— »(W},W2,bj2ib2j)  is  well-defined  and  the 
Jacobian  of  the  mapping 


K  -  _ 

-  d(XQQ,A)  - 


det 


•11 

*12 

*13 

*14 

*21 

*22 

*23 

*24 

*31 

*32 

*33 

*34 

*41 

*42 

*43 

*44 

(3.3.3) 


dw.  9w2  ^^12  ^^21  ^'^2 

where  ^  ,  aj2=  ^  ^  ,  a^^  =  ^  ,  821=  ,  822  =  ^ 

5bj9  3b2|  9wj  3w2  5b22  ^b2j  5b2j 

*23  =  ^  ’  *24  =  ab^  ’  *31  =  ^  ’  *32=  ^  ’  *33  =  ^  ’  *34  =  ^  ’  *41  = 

,  _  ^‘’21 

42  ab22  ’  ab22  ’  44  ab22 

dvf„  aw,  aw,  aw„ 

By  expanding  the  determinant,  K  =  57—^  or  *■  —  gr— ■■  ac  ’ • 

abu  ab22  abjj  db22 


In  order  to  obtain  the  Jacobian,  let's  use  Lemma  2.1.  Let’s  take  w(t)  =  h(t,tQ,XQ,A)  be  the  solution 
process  of  (3.3.2). 


Then,  =  Y  is  the  solution  of  the  I.V.P., 

abjj 

^=gw(t,w(t),A)Y-^^i^^^J^^  y(t(,)  =  0  (3.3.4) 

Similarly,  =  Z  satisfies, 

°'"iQ 

g  =  gw(t,w(t),A)Z,  Z(to)  =  ej  (3.3.5) 

From  (3.3.2),  (3.3.4)  can  be  obtained  as  follows: 
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-bjj(oj+2wj)-bj2W2 


-b,2(a,+wj)  -|Y+|l(t,w(t).A),  Y(t(,) 

~*’22<“2+  2w2)-4i*iJ 


=  0 

(3.3.6) 


-hm  ^(t,.(t),A)=  [  -(«l+*l)*2 


0 

-(o2+W2)wi 


We  can  solve  (3.3.6)  to  obtain  the  Jacobian  K. 

Then,  the  Joint  density  of  w^,  W2,  b^2<  ^21  ^  obtained  by  using, 

fwj,W2,b2i,b22^”l’''2’*’21‘**22)  =  fbu,bj2,b2i,b22^*’ll’‘*12’‘'21'‘»22)'K 

Individual  or  marginal  densities  can  be  obtained  by  integrating  (3.3.7). 


NUMERICAL  METHODS 


(3.3.7) 


In  this  section,  we  will  illustrate  the  analytical  results  obtained  in  the  previous  sections  along  with 
standard  numerical  methods  to  obtain  the  joint  probability  density  functions  and  the  marginal 
probability  density  functions  of  the  solutions  of  exponential,  logistic  and  competing  species  models 
involving  random  parameters  and  random  initial  conditions. 

The  numerical  technique  we  have  adapted  is  somewhat  similar  to  [2],  for  the  case  in  which  the 
initial  conditions  are  completely  random.  The  method  involves  numerical  computation  of  the  inverse 
mapping  X— »Xq  and  using  standard  numerical  quadrature  methods  in  order  to  obtain  marginal 
probability  density  functions. 


Example4.1.  Consider  the  exponential  growth  model, 

^  =  rN,  N(0)  =  Nq  (4.1) 

Assume  that  the  parameters  Ng,  r  are  random  variables  Uniformly  distributed  on  the  interval  (1,2). 
The  solution  of  (4.1)  can  be  easily  obtained  as, 

N(t)  =  Nge'S  t>0 

and  the  joint  probability  probability  density  function  of  (N,r)  are  given  by, 

fN,r(».N,r)  =  K  r<  2,  1<  Ne-"*<  2  (4.2) 

The  marginal  probability  density  function  N  in  (4.2)  can  be  obtained  as  follows: 
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fj^(t.N)  = 


C  e‘<N<e2t 

/  Ne“‘-Ne-2t],  ^21^  2e‘ 

\  ^(2-Ne“2t],  2e‘<N<2e2‘  (4.3) 

for  t<  ln2,  and  for  t  >  ln2 

r  e‘<N<2e‘ 

fN(‘’N)  =  ^  t}}.  2e‘<  N<  e2t 

2-Ne“2t]^  e2‘<N<2e2‘  (4.4) 

and  fji^(t,N)  =0  outside  the  indicated  region. 

The  Jacobian  J  of  the  inverse  mapping  N— ►Nq  is  governed  by, 

^  =  -rJ,  J(0)  =  1  (4.5) 

In  the  numerical  scheme  proposed  in  [2]  a  value  of  r  is  chosen  in  the  interval  (1,2),  along  with  a 
value  of  N.  Then,  (4.1)  is  integrated  backwards  in  time  from  a  given  value  of  t  to  determine  the 
corresponding  value  of  Nq.  This  species  the  inverse  mapping.  Then,  (4.5)  is  integrated  forwards  in 
time  using  the  calculated  time  history  N(t)  to  obtain  J.  This  procedure  will  leads  to  the  joint  pdf 
%  This  Joint  pdf  will  be  numerically  integrated  with  respect  to  r  to  obtain  the  marginal  pdf. 

The  results  are  shown  in  Fig.  1.  Again  the  numerical  solution  agrees  with  the  exact  solution  to  within 
the  resolution  of  the  plot. 

When  Nq  is  a  deterministic  constant,  the  pdf  fp^(t,N)  can  be  found  to  be 

.  Nqc‘  <  N  <  NQe2‘  (4.6) 

and  identically  equal  to  zero  outside  the  indicated  region. 

Now  the  inverse  mapping  is  the  mapping  r— ♦N.  For  a  given  value  of  t,  this  mapping  may  be 
calculated  by  a  simple  numerical  shooting  procedure  which  involves  choosing  a  value  of  N,  and  then 
varying  the  value  of  r  systematically  so  as  to  arrive  at  the  chosen  value  of  N  when  (4.1)  is  integrated 
forward  in  time.  The  Jacobian  K  of  the  mapping  r— »N  is  just  K  =  ^  =  Y,  and  from  (2.2) 

^  =  rY+  N,  Y(0)  =  0  (4.7) 

In  the  numerical  scheme,  given  the  value  of  r  from  the  shooting  procedure  and  the  time  history  N(t), 
(4.7)  can  then  be  numerically  integrated  to  obUun  Y.  The  Jacobian  of  the  inversae  mapping  N  — »r  is 
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given  by  J  =  1/K=1/Y.  An  application  of  Lemma  2.2,  equation  (2.5)  then  yields  directly  the  pdf  for 
the  solution  fj^(t,N). 

We  now  consider  an  example  from  logistic  growth  model  which  illustrates  the  second  of  the  three  cases 
in  Lemma  2.2. 

Example  4.2.  Consider  the  logistic  growth  model, 


^  =  rN(K-N),  N(0)  =  Nq  >0  (4.8) 

Here  r,  K  are  random  parameters,  and  Nq  is  a  deterministic  constant. 

In  order  to  get  the  Jacobian,  consider  the  mapping  (Xqq,A)  =  (r,K)— »X  =  (N,r)  is  well-defined  and 
the  Jacobian  of  the  direct  mapping, 


-  I  a(r,K)  I-  [  I  <'■’> 


The  joint  density  of  N,  r  is  given  by, 


fWQe  — n; 


(4.10) 


Assume  that  r  and  K~i.i.d.  with  Unif(l,2).  Then,  the  joint  density  of  (N,r)  given  by  (4.8)  reduces  to, 

Nn^(e"‘-l)e"‘  NNn(l-e‘'‘) 

fN  r(^N,r)  =  ; - V  ;  1  <  r  <  2,  1  <  — 2 - ^  <  2.  (4.11 

(N  e^t_N)2  (N-N^e'^) 


Therefore,  the  marginal  density  function  of  N  can  be  obtained  as  follows; 


‘  NqC^-N  t(NQe2‘-N)  *  ^ 

1  r  Nne^^-N]  NneV-l)(N-Nn) 

iln  -^-7 -  -  t  ’  ^2<  N<  Ng- 

‘  N^e^-Nj  t(NQe2‘_N)(Noe‘_N)  ^ 

,.  r(Noe^^_N)(2-N)]  (2-N)  _  (N-Np) 

‘  J  t(Noe^‘_N)’  ^  '* 


fN(t.N)  = 


Nj  <  N  <  N2 


where  N ,  = - 7^ -  ,  No 

i  KT  1  \  I  1  ^ 


,  N,= 


Np(e2‘-l)-t-  r‘'2  Np(e‘-1) -*-1’ NQ(e2‘-l) 2*  ^  Np(e‘-l)-f-2 
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Example  I :  Case  1 :  t  <  In  2 
t  =  as 


Example  1 :  Case  2 :  t  >  In  2 
t  =  0. 75 


.07 


Example  2 ;  I  =  1 ,  NO  =  5 


I 


The  numerical  scheme  is  similar  to  the  one  described  in  Example  4.1.  The  results  are  shown  in  Fig.  2 
for  Nq=  5.  Again  the  numerical  solution  agrees  with  the  exact  solution  to  within  the  resolution  of  the 
plot. 
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1.  INTRODUCTION 

Singularly  perturbed  systems  with  two-time  scale  and,  more  generally,  multi-time  scale  systems, 
often  occur  naturally  due  to  the  presence  of  small  ”  parasitic”  parameters  multiplying  derivatives 
[3,8,9].  In  the  last  two  decades  the  method  of  asymptotic  expansion  which  is  based  on  order  reduction 
and  boundary  layer  corrections  has  been  widely  used  for  such  ssrstems  (7,9).  Recently,  an  alternative 
approach  has  been  developed,  where  one  develops  a  suitable  non-singular  linear  transformation  which 
partially  or  totally  decouples  the  original  system.  The  transformed  system  enables  one  to  study  the 
stability  of  original  system  with  relative  ease.  This  idea  was  initiated  by  Khalil  and  Kokotovic  [2]  for  a 
two-time  scale  problem  and  Chang[l]  for  a  general  boundary  value  problem.  Later,  Ladde  and 
Siyak[6],  Ladde  anbd  Rajaluksmi  [5],  Ladde  and  Kathirkamanayagan[4]  have  used  the  idea  for 
multiple  time-scale  and  multi-parameter  problems. 

In  this  paper,  a  procedure  to  totally  decouple  a  two-time  scale  singularly  perturbed  linew 
integro-dinerential  system  is  developed.  The  procedure  utilizes  Chang’s  transformation  [1]  in  a 
systematic  and  coherent  manner.  The  fast  and  slow  mode  decomposition  process  provides  a  very 
elegant  and  powerful  mechanisism  to  investigate  the  stability  and  approximation  analysis  of  the 
original  system  in  terms  of  an  auxiliary  system  corresponding  to  the  decoupled  system.  Furthermore, 
the  validity  of  the  transformation  is  also  discussed.  The  representation  of  the  transformation  in  terms 
of  the  given  coefficient  matrices  is  given.  Stability  and  the  approximation  to  the  solution  of  the 
original  system  are  also  investigated.  Finally,  an  example  illustrating  the  decoupling  procedure  and 
its  applicability  is  presented. 


This  research  reported  herein  was  supported  by  the  U.  S.  Army  Research  Office  Grant 
No.  DAAH04-93-G-0024  and  the  National  Security  Agency  Grant  No.  MDA904-93-H-2002. 
This  paper  was  presested  at  the  Tenth  Conference  in  this  series. 
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2.  PRELIMINARIES 


Consider  a  linear  time-varying  system  of  integro-difTerentiaJ  equations 

p  X  =  A  X,  x(to)  =  Xq 


where, 


'^ll  'I'l2 

Tji  T22 


0  f 


ep  U? 

,  X  =  [  Xj,  X2I  ;  Xj€R  for  i=l,2  and  n=  nj-»-  02; 


’^11  =  ^11  *^ll’  *^12  ~  ^12  +  ^12>  "^21  ~  ^21  ^21>  ^22  ~  ^22  +  *^22^ 

Ky's  are  linear  Volterra  operators  and  Ay’s  are  matrices;  €  >  0. 

We  assume  that 

Hj:  The  operators  T-’s  are  bounded  with  respect  to  t>  t^; 

H2:  Assume  that  T22  is  invertible  for  all  t  >tQ; 


The  e— boumdary  layer  system  can  be  obtained  by  putting  e  =  0  in  (2.1).  It  is  given  by, 

x’l  =  Til  *1  +  "’^12  *2-  ~  *10 

and  0  =  *^21  *1  "'^22  *2'  ^2(^0^  ~  *2  ^  *20  (2-2) 


Let  V  and  W  are  two  Volterra  operators  defined  as  follows: 
t 

Vx  =  D(t)  *  +  I  »(t»s)x(s)  ds 

‘0 

t 

Wy  =  E(t)  y  I  b(t,s)y(s)  ds  (2.3) 

*0 

where  D,  a  are  mxn  continuous  matrices,  E,  b  are  continuous  nxm  matrices  and  x€R",  yER*^. 

The  composition  of  V  and  W  are  defined  by: 

t  " 

(VW)(y)  =  V(Wy)  =  D(t)E(t)y  |  D(t)  b(t,s)  +  a(t,s)E(s)  + 

tQ- 

and  the  derivative  of  the  operator  V  is  defined  by, 

t 

(Vx)(t)  =  D(t)x  +  a(t,tg)xQ  +  |[  ajt.s)  -t-  as(t,s)^  x(s)  ds. 

‘0 

The  following  two  properties  of  the  derivatives  of  Volterra  operators  V  and  W  are  needed  in  our 
subsequent  discussion: 

(1)  The  derivative  of  the  Volterra  operator  V  satisfies 
Vx(t)]  =  (Vx)(t)  -»■  (Vx)(t)  where  x  = 
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(2)  The  derivative  of  the  composition  operator  of  V  and  W  satisfies 
VWx(t)]  =  V(Wx)(t)  V(Wx)(t). 

3.  DIAGONALIZATION  PROCESS 

In  this  section  our  prime  objective  is  to  develop  a  procedure  to  totally  decouple  the  original  system 
(2.1).  This  can  be  acheived  by  applying  a  transformation  which  decouples  the  fastest  state  variable  in 
the  coupled  two-time  scale  system.  The  validity  of  such  transformations  will  be  discussed  in  the 
succeeding  section.  The  following  procedure  briefly  explains  the  method  to  totally  decouple  the  original 
system  (2.1). 

We  consider  the  following  transformation  S:  C^x  C2-*C^x  C2  defined  by 


[■‘7 


-(ML 


-cM  1 


1*5 

where  L  and  M  are  unknown  linear  Volterra  operators  which  are  functions  of  time;  C-=C[R,R  ']  and 
Ij—  mxm  identity  matrix,  I2—  nxn  identity  matrix. 

We  note  that  the  inverse  of  S  is  given  by 


S“^= 


r  1,  fM  1 

L  -I 


Now,  we  can  apply  the  transformation  to  the  system  (2.1)  with  m=nj,  n=n2: 


Z  =  SX 


Z  =  (ss~4  S/i~Us“^)  z 


Here,  Z  =  (zi,Z2)^,  Zj  e  R  ‘  for  i=l,2; 


Set,  SS  S;i  ^AS  ^  =  P,  where  P  = 


**11  **12 
P21  P22 


Choose  L  and  M  in  (3.1)  so  that  Pj2=0  and  P2j=0.  These  two  identities  leads  to  the  following: 

Pi2=  0  implies  that,  eL  =  T22L  — fLTjj-H  €LTi2b— T21  (3.4) 

P2i=0  implies  that,  cM  =-MT22+f(Tii— Ti2L)M— tMLTi2+Tj2  (3.5) 

Under  these  conditions  (3.4)  and  (3.5),  Pjj  and  P12  can  be  written  as: 

P^l  —  (Tjj  Tj2b)  and  P22  ~  **'^12'^  ^  ^'^22'  (3‘6) 


Hence  the  system  (3.3)  reduces  to: 
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(3.7) 


*i]  -  r^ii*'^x2^ 

So  0 

•J 


0 


+  LT 


»(‘o)  =  *0 


4.  VALIDITY  AND  APPROXIMATION  OF  THE  TRANSFORMATIONS 
In  order  to  establish  the  validity  of  the  transformation,  we  will  establish  the  existence,  uniqueness, 
boundedness  and  other  fundamental  properties  of  the  following  abstract  Cauchy  problem  in  L  and  M. 

fL  —  "^22^  ^  fL(Tj2L — Tjj) — T2j 

fM  =  -MT22  +  e(T22-Ti2L)M  -tMLTi2+  T12  (4-1) 

with  initial  conditions, 

M*’o)  =  ^22  ^(^0^  '^2x(^o) 

M{tQ)  =  Ti2(to)  T22  ^(to). 

From  continuity  of  the  matrices  and  kernels  in  (2.1)  and  continuous  differentiability  of  the  right  hand 
side  of  (2.1)  relative  to  L  and  M,  the  existence  and  uniqueness  of  the  problem  (4.1)  follows 
immediately. 


Remark  4.1:  The  sufficient  conditions  to  establish  the  inverse  of  T22  can  be  given  as  follows: 

t 

Define,  T22U  =  A22(t)  “  +  |  K22(t,s)  u(s)ds 

to 

00 

and,  T22~^u  =  C(t)  u  +  |  F(t,s''  u(s)  ds 
Then,  0 

(i)  C(t)  =  A22~^(t)  and  F(t,s)  satisfies  the  integral  equations 

t 

(ii)  A22(t)  F(t,s)  +  K22(t,s)C(s)  +  |K22(t,s)  F(C)F{C,s)dC  =0  and 

s 

t 

(iii)  C(t)  K22(t,s)  +  F(t,s)  A22(s)  |F(t,C)  K22(C,s)  dC  =0,  (4.2) 

s 

We  need  the  following  assumptions  in  order  to  establish  the  results. 

(H3):  T22  ^T2i,  T12T22  ^  and  their  derivatives  are  bounded  on  [to,oo). 

(H^):  i'(T22)  <  “Ofi  a  >0  where  i/(T22)  =  lini  sup  1  -  is  the  logarithmic  norm  of 

h^o'*'  ^ 

the  linear  Volterra  operator  T22. 
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Theorem  4.1:  Under  the  aasumptioiu  (Hj)—(H4),  the  abstract  Cauchy  problem  (4.1)  he*  atmoet  one 
solution  (L,M)  existing  on  [  tg.oo). 

Moreover, 

L(t)  =  L(t)  +  0(0 

M(t)  =  M(t)  +  0(0  (4.3) 

where 

L  —  — ^22  ^^21  and  M  —  ^12^22 

Proof:  The  proof  of  the  theorem  can  be  formulated  analogous  to  the  result  in  [5]  with  certain 

modifications. 


5  STABILITY  RESULTS 


In  this  section,  we  establish  the  main  result  concerning  the  approximate  solution  and  the  stability  of 
the  original  system  (2.1).  An  approximate  solution  of(2.1)  can  be  obtained  as  follows: 


The  totally  decoupled  system  of  (2.1)  can  be  written  as. 


Uj  —  (Tjj  — Ti2L)ui+  0(c),  Uj(tQ)  —  Uj® 

and  U2  =  (cLTi2+T22)u2+  0(«).  U2(*'o)  =  **2° 


(5.1) 


We  note  that  system  (5.1)  can  be  considered  as  a  perturbed  system  [6]  of 
'"l  =  (  Tii-TjjL)  vj  +  0(c),  Vi(to)  =  vi® 

and  cv2=  (cLTi2+  T22)  ''2  +  0(c),  V2(to)  =  V2®  (5.2) 

where  v®=  T^Xq. 

This  system  can  be  considered  as  an  auxiliary  system  of  (5.1).  We  need  an  additional  assumption  to 
establish  our  results. 


(H5)= 


lim  sup 
t— 00 


WT„-T,jL)(s)d, 

^0 


< 


—  aj<  0. 


Lemma  5.1:  Let  the  assumptions  Hg,  and  hold.  Then,  one  can  chose  c*  >  0  such  that  for  all 

f  < 

(i)  The  trivial  solution  of  (5.1)  is  exponentially  asymptotically  stable. 

(ii)  Ui=  Vj+  0(c),  U2=  V2+  0(c). 
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Pfoof:  The  proof  of  the  lenune  can  be  f<r)nnulated  analogous  to  the  result  in  [6]  with  certain 

modifications. 


Theorem  5.1;  Under  the  assumptions  of  Themem  4.1  and  Lemma  5.1, 

(i)  The  trivial  solution  of  (2.1)  is  exponentiallly  asymptotically  stable. 

(ii)  The  solution  of  (2.1)  can  be  approximated  by  S~^s,  where  s  is  the  solution  of  (5.2),  that  is 
x(t)  =  S~^i  +  0(«),  where 


S“^= 


-fM  1 

Ij-fLMj 


Proof:  The  proof  of  the  lemma  can  be  formulated  analogous  to  the  result  in  [4]  with  certain 

modifications. 


Example  5.1;  Consider  the  following  system. 


1  0 

0  < 


‘I'll  "^12 


S]  I  T2.  [Jj]  ' 


t  t 

where  Tji  x  =  ^  +  |  ~9«  *^x(®)  <1®)  =  — 5x  +  1 3e  *^x(s)ds, 


T21X  =  — 3x  +  J  — 2e  *^x(s)  ds,  T22X  =  — 5x  +|  — 2e  *^x(8)  ds. 


We  obtain  the  following  : 

(i)  The  operator  T22  is  invertible  and  the  inverse  operator  of  T22, 
T22  =  -iu  +  J  ^e  ^  u(s)  ds. 


(ii)  The  Logarithmic  Norm  of  the  transformation  T22  i®  given  by, 

^(”^22)  ^ 

(iii)  The  solution  of  the  Cauchy  problem  (4.1)  is  given  by, 

L  =  L  +  o(e) 

M  =  M  +  o(f),  where  L  and  M  are  given  by. 
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Lu  =  u(8)  d., 

*0 

and  Mu  =  u  +  |^  ^  “(®)  *•*• 

^0 

(iv)  The  totally  decoupled  system  can  be  obtained  as, 

i"  + 1  [- 

‘o  t 

and  fUj  =  — 5u2  +  — 2e  ^U2(®) 

*0 

(v)  The  system  in  (iv)  can  be  considered  as  a  perturbed  system  of, 

V  -  5v  +fr  n7--5(‘-«>  192  16711.-7(1-8)1  . 

—  2''!  +  J  L“  IS"*  "!»*  ■  im^ 

‘o  t 

f  -2(t-s) 

and  V2  =  —  5v2  +  I  —  2e  ^''2(5)  ds  (5-6) 

*0 

with  Vj(to)  =  Vj®,  and  v2(tQ)  =  V2®. 

(vi)  The  solution  of  the  original  system  by  applying  the  main  results  can  be  approximated  by, 

Xi(t)  =Vi(t)  +  o(e) 

X2(t)  =  -Lvi(t)  +  V2(t)  +  o(f)  (5.7) 

where  Vj(t),  V2(t)  are  the  solutions  of  (v). 

Note  that  the  Logarithmic  Norm  of  r^,  i/(rg^)  <  —0.304,  where 


=-i‘ 


[L  117^-5(t-s)_192^  167ne“'^(‘'®)lu(s)  ds 

J  L  26  199  2990  J  1  1 


implies  that  the  trivial  solution  of  the  original  system  (5.4)  is  exponentially  asymptotically  stable. 


The  details  of  the  proofs  of  these  presented  results  will  appear  elsewhere. 
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